

©WILEY 


Design and Analysis 
of Experiments 

Volume l 

Advanced Experiraenta] Design 


K l a w 其 II i n k e l m a n n 
O fi c ff r K if m p t h o r n e 


IVi f-p ^ c r J -r ■ T h f et« tali' iJ if j ^ 1 iTUiif Sfafiflllc-fr 





Design and Analysis 
of Experiments 



WILEY SERIES IN PROBABILITY AND STATISTICS 

Established by WALTER A. SHEWHART and SAMUEL S. WILKS 

Editors: David J. Balding, Noel A. C. Cressie, Nicholas I. Fisher, 

Iain M. Johnstone, J. B. Kadane, Geert Molenberghs, Louise M. Ryan, 

David W. Scott, Adrian F. M. Smith, Jozef L. Teugels 

Editors Emeriti: Vic Barnett, J. Stuart Hunter, David G. Kendall 


A complete list of the titles in this series appears at the end of this volume. 



Design and Analysis 
of Experiments 

Volume 2 

Advanced Experimental Design 


KLAUS HINKELMANN 

Virginia Polytechnic Institute and State University 
Department of Statistics 
Blacksburg, VA 

OSCAR KEMPTHORNE 

Iowa State University 
Department of Statistics 
Ames, IA 


® WILEY- 

INTERSCIEHCE 

A JOHN WILEY & SONS, INC., PUBLICATION 



Copyright © 2005 by John Wiley & Sons, Inc. All rights reserved. 

Published by John Wiley & Sons, Inc., Hoboken, New Jersey. 

Published simultaneously in Canada. 

No part of this publication may be reproduced, stored in a retrieval system, or transmitted in any 
form or by any means, electronic, mechanical, photocopying, recording, scanning, or otherwise, 
except as permitted under Section 107 or 108 of the 1976 United States Copyright Act, without 
either the prior written permission of the Publisher, or authorization through payment of the 
appropriate per-copy fee to the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, 
MA 01923, 978-750-8400, fax 978-646-8600, or on the web at www.copyright.com. Requests to 
the Publisher for permission should be addressed to the Permissions Department, John Wiley & 
Sons, Inc., Ill River Street, Hoboken, NJ 07030, (201) 748-6011, fax (201) 748-6008. 

Limit of Liability/Disclaimer of Warranty: While the publisher and author have used their best 
efforts in preparing this book, they make no representations or warranties with respect to the 
accuracy or completeness of the contents of this book and specifically disclaim any implied 
warranties of merchantability or fitness for a particular purpose. No warranty may be created or 
extended by sales representatives or written sales materials. The advice and strategies contained 
herein may not be suitable for your situation. You should consult with a professional where 
appropriate. Neither the publisher nor author shall be liable for any loss of profit or any other 
commercial damages, including but not limited to special, incidental, consequential, or other 
damages. 


For general information on our other products and services please contact our Customer Care 
Department within the U.S. at 877-762-2974, outside the U.S. at 317-572-3993 or fax 
317-572-4002. 


Wiley also publishes its books in a variety of electronic formats. Some content that appears in 
print, however, may not be available in electronic format. 

Library of Congress Cataloging-in-Publication Data is available. 

ISBN 0-471-55177-5 

Printed in the United States of America. 


10 987654321 



Contents 


Preface 

1 General Incomplete Block Design 

1.1 Introduction and Examples, 1 

1.2 General Remarks on the Analysis of Incomplete Block 
Designs, 3 

1.3 The Intrablock Analysis, 4 

1.3.1 Notation and Model, 4 

1.3.2 Normal and Reduced Normal Equations, 5 

1.3.3 The C Matrix and Estimable Functions, 7 

1.3.4 Solving the Reduced Normal Equations, 8 

1.3.5 Estimable Functions of Treatment Effects, 10 

1.3.6 Analyses of Variance, 12 

1.4 Incomplete Designs with Variable Block Size, 13 

1.5 Disconnected Incomplete Block Designs, 14 

1.6 Randomization Analysis, 16 

1.6.1 Derived Linear Model, 16 

1.6.2 Randomization Analysis of ANOVA Tables, 18 

1.7 Interblock Information in an Incomplete Block Design, 23 

1.7.1 Introduction and Rationale, 23 

1.7.2 Interblock Normal Equations, 23 

1.7.3 Nonavailability of Interblock Information, 27 

1.8 Combined Intra- and Interblock Analysis, 27 

1.8.1 Combining Intra- and Interblock Information, 27 

1.8.2 Linear Model, 27 

1.8.3 Normal Equations, 28 

1.8.4 Some Special Cases, 31 



vi 


CONTENTS 


1.9 Relationships Among Intrablock, Interblock, and Combined 
Estimation, 31 

1.9.1 General Case, 32 

1.9.2 Case of Proper, Equireplicate Designs, 34 

1.10 Estimation of Weights for the Combined Analysis, 36 

1.10.1 Yates Procedure, 37 

1.10.2 Properties of Combined Estimators, 38 

1.11 Maximum-Likelihood Type Estimation, 39 

1.11.1 Maximum-Likelihood Estimation, 39 

1.11.2 Restricted Maximum-Likelihood 
Estimation, 40 

1.12 Efficiency Factor of an Incomplete Block Design, 43 

1.12.1 Average Variance for Treatment Comparisons for 
an IBD, 43 

1.12.2 Definition of Efficiency Factor, 45 

1.12.3 Upper Bound for the Efficiency Factor, 47 

1.13 Optimal Designs, 48 

1.13.1 Information Function, 48 

1.13.2 Optimality Criteria, 49 

1.13.3 Optimal Symmetric Designs, 50 

1.13.4 Optimality and Research, 50 

1.14 Computational Procedures, 52 

1.14.1 Intrablock Analysis Using SAS PROC GLM, 52 

1.14.2 Intrablock Analysis Using the Absorb Option in 
SAS PROC GLM, 58 

1.14.3 Combined Intra- and Interblock Analysis Using the 
Yates Procedure, 61 

1.14.4 Combined Intra- and Interblock Analysis Using 
SAS PROC MIXED, 63 

1.14.5 Comparison of Estimation Procedures, 63 

1.14.6 Testing of Hypotheses, 66 

2 Balanced Incomplete Block Designs 71 

2.1 Introduction, 71 

2.2 Definition of the BIB Design, 71 

2.3 Properties of BIB Designs, 72 

2.4 Analysis of BIB Designs, 74 

2.4.1 Intrablock Analysis, 74 

2.4.2 Combined Analysis, 76 

2.5 Estimation of p, 77 



CONTENTS 


vii 


2.6 Significance Tests, 79 

2.7 Some Special Arrangements, 89 

2.7.1 Replication Groups Across Blocks, 89 

2.7.2 Grouped Blocks, 91 

2.7.3 a-Resolvable BIB Designs with Replication 
Groups Across Blocks, 96 

2.8 Resistant and Susceptible BIB Designs, 98 

2.8.1 Variance-Balanced Designs, 98 

2.8.2 Definition of Resistant Designs, 99 

2.8.3 Characterization of Resistant Designs, 100 

2.8.4 Robustness and Connectedness, 103 

3 Construction of Balanced Incomplete Block Designs 

3.1 Introduction, 104 

3.2 Difference Methods, 104 

3.2.1 Cyclic Development of Difference Sets, 104 

3.2.2 Method of Symmetrically Repeated 
Differences, 107 

3.2.3 Formulation in Terms of Galois Field Theory, 112 

3.3 Other Methods, 113 

3.3.1 Irreducible BIB Designs, 113 

3.3.2 Complement of BIB Designs, 113 

3.3.3 Residual BIB Designs, 114 

3.3.4 Orthogonal Series, 114 

3.4 Listing of Existing BIB Designs, 115 

4 Partially Balanced Incomplete Block Designs 

4.1 Introduction, 119 

4.2 Preliminaries, 119 

4.2.1 Association Scheme, 120 

4.2.2 Association Matrices, 120 

4.2.3 Solving the RNE, 121 

4.2.4 Parameters of the Second Kind, 122 

4.3 Definition and Properties of PBIB Designs, 123 

4.3.1 Definition of PBIB Designs, 123 

4.3.2 Relationships Among Parameters of a PBIB 
Design, 125 

4.4 Association Schemes and Linear Associative Algebras, 127 

4.4.1 Linear Associative Algebra of Association 

Matrices, 127 


104 


119 



Vlll 


CONTENTS 


4.4.2 Linear Associative Algebra of 7 Matrices, 128 

4.4.3 Applications of the Algebras, 129 

4.5 Analysis of PBIB Designs, 131 

4.5.1 Intrablock Analysis, 131 

4.5.2 Combined Analysis, 134 

4.6 Classification of PBIB Designs, 137 

4.6.1 Group-Divisible (GD) PBIB(2) Designs, 137 

4.6.2 Triangular PBIB(2) Designs, 139 

4.6.3 Latin Square Type PBIB(2) Designs, 140 

4.6.4 Cyclic PBIB(2) Designs, 141 

4.6.5 Rectangular PBIB(3) Designs, 142 

4.6.6 Generalized Group-Divisible (GGD) PBIB(3) 

Designs, 143 

4.6.7 Generalized Triangular PBIB(3) Designs, 144 

4.6.8 Cubic PBIB(3) Designs, 146 

4.6.9 Extended Group-Divisible (EGD) PBIB 
Designs, 147 

4.6.10 Hypercubic PBIB Designs, 149 

4.6.11 Right-Angular PBIB(4) Designs, 151 

4.6.12 Cyclic PBIB Designs, 153 

4.6.13 Some Remarks, 154 

4.7 Estimation of p for PBIB(2) Designs, 155 

4.7.1 Shah Estimator, 155 

4.7.2 Application to PBIB(2) Designs, 156 

5 Construction of Partially Balanced Incomplete Block Designs 158 

5.1 Group-Divisible PBIB(2) Designs, 158 

5.1.1 Duals of BIB Designs, 158 

5.1.2 Method of Differences, 160 

5.1.3 Finite Geometries, 162 

5.1.4 Orthogonal Arrays, 164 

5.2 Construction of Other PBIB(2) Designs, 165 

5.2.1 Triangular PBIB(2) Designs, 165 

5.2.2 Latin Square PBIB(2) Designs, 166 

5.3 Cyclic PBIB Designs, 167 

5.3.1 Construction of Cyclic Designs, 167 

5.3.2 Analysis of Cyclic Designs, 169 

5.4 Kronecker Product Designs, 172 

5.4.1 Definition of Kronecker Product Designs, 172 



CONTENTS 


ix 


5.4.2 Properties of Kronecker Product Designs, 172 

5.4.3 Usefulness of Kronecker Product Designs, 177 

5.5 Extended Group-Divisible PBIB Designs, 178 

5.5.1 EGD-PBIB Designs as Kronecker Product 
Designs, 178 

5.5.2 Method of Balanced Arrays, 178 

5.5.3 Direct Method, 180 

5.5.4 Generalization of the Direct Method, 185 

5.6 Hypercubic PBIB Designs, 187 

6 More Block Designs and Blocking Structures 

6.1 Introduction, 189 

6.2 Alpha Designs, 190 

6.2.1 Construction Method, 190 

6.2.2 Available Software, 192 

6.2.3 Alpha Designs with Unequal Block 

Sizes, 192 

6.3 Generalized Cyclic Incomplete Block Designs, 193 

6.4 Designs Based on the Successive Diagonalizing 
Method, 194 

6.4.1 Designs for t = Kk, 194 

6.4.2 Designs with t = n 2 , 194 

6.5 Comparing Treatments with a Control, 195 

6.5.1 Supplemented Balance, 196 

6.5.2 Efficiencies and Optimality Criteria, 197 

6.5.3 Balanced Treatment Incomplete Block 
Designs, 199 

6.5.4 Partially Balanced Treatment Incomplete Block 
Designs, 205 

6.5.5 Optimal Designs, 211 

6.6 Row-Column Designs, 213 

6.6.1 Introduction, 213 

6.6.2 Model and Normal Equations, 213 

6.6.3 Analysis of Variance, 215 

6.6.4 An Example, 216 

6.6.5 Regular Row-Column Designs, 230 

6.6.6 Doubly Incomplete Row-Column Designs, 230 

6.6.7 Properties of Row-Column Designs, 232 

6.6.8 Construction, 237 

6.6.9 Resolvable Row-Column Designs, 238 


189 



X 


CONTENTS 


7 Two-Level Factorial Designs 241 

7.1 Introduction, 241 

7.2 Case of Two Factors, 241 

7.2.1 Definition of Main Effects and Interaction, 241 

7.2.2 Orthogonal Contrasts, 243 

7.2.3 Parameterizations of Treatment Responses, 244 

7.2.4 Alternative Representation of Treatment 
Combinations, Main Effects, and Interaction, 247 

7.3 Case of Three Factors, 248 

7.3.1 Definition of Main Effects and Interactions, 249 

7.3.2 Parameterization of Treatment Responses, 252 

7.3.3 The x -Representation, 252 

7.4 General Case, 253 

7.4.1 Definition of Main Effects and Interactions, 254 

7.4.2 Parameterization of Treatment Responses, 256 

7.4.3 Generalized Interactions, 258 

7.5 Interpretation of Effects and Interactions, 260 

7.6 Analysis of Factorial Experiments, 262 

7.6.1 Linear Models, 262 

7.6.2 Yates Algorithm, 263 

7.6.3 Variances of Estimators, 265 

7.6.4 Analysis of Variance, 265 

7.6.5 Numerical Examples, 267 

7.6.6 Use of Only One Replicate, 278 

8 Confounding in 2 n Factorial Designs 279 

8.1 Introduction, 279 

8.1.1 A Simple Example, 279 

8.1.2 Comparison of Information, 280 

8.1.3 Basic Analysis, 280 

8.2 Systems of Confounding, 283 

8.2.1 Blocks of Size 2 n ~\ 283 

8.2.2 Blocks of Size 2 n ~ 2 , 283 

8.2.3 General Case, 285 

8.3 Composition of Blocks for a Particular System of 
Confounding, 289 

8.3.1 Intrablock Subgroup, 289 

8.3.2 Remaining Blocks, 290 

8.4 Detecting a System of Confounding, 291 



CONTENTS 


xi 


8.5 Using SAS for Constructing Systems of Confounding, 293 

8.6 Analysis of Experiments with Confounding, 293 

8.6.1 Estimation of Effects and Interactions, 293 

8.6.2 Parameterization of Treatment Responses, 297 

8.6.3 ANOVA Tables, 298 

8.7 Interblock Information in Confounded Experiments, 303 

8.8 Numerical Example Using SAS, 311 

9 Partial Confounding in 2 n Factorial Designs 

9.1 Introduction, 312 

9.2 Simple Case of Partial Confounding, 312 

9.2.1 Basic Plan, 312 

9.2.2 Analysis, 313 

9.2.3 Use of Intra- and Interblock Information, 315 

9.3 Partial Confounding as an Incomplete Block Design, 318 

9.3.1 Two Models, 318 

9.3.2 Normal Equations, 320 

9.3.3 Block Contrasts, 322 

9.4 Efficiency of Partial Confounding, 323 

9.5 Partial Confounding in a 2 3 Experiment, 324 

9.5.1 Blocks of Size 2, 324 

9.5.2 Blocks of Size 4, 325 

9.6 Partial Confounding in a 2 4 Experiment, 327 

9.6.1 Blocks of Size 2, 327 

9.6.2 Blocks of Size 4, 328 

9.6.3 Blocks of Size 8, 328 

9.7 General Case, 329 

9.7.1 Intrablock Information, 330 

9.7.2 The ANOVAs, 330 

9.7.3 Interblock Information, 332 

9.7.4 Combined Intra- and Interblock Information, 333 

9.7.5 Estimation of Weights, 333 

9.7.6 Efficiencies, 334 

9.8 Double Confounding, 335 

9.9 Confounding in Squares, 336 

9.9.1 2 3 Factorial in Two 4x4 Squares, 337 

9.9.2 2 4 Factorial in 8 x 8 Squares, 338 

9.10 Numerical Examples Using SAS, 338 

9.10.1 2 3 Factorial in Blocks of Size 2, 338 

9.10.2 2 4 Factorial in Blocks of Size 4, 350 


312 



Xll 


CONTENTS 


10 Designs with Factors at Three Levels 

10.1 Introduction, 359 

10.2 Definition of Main Effects and Interactions, 359 

10.2.1 The 3 2 Case, 359 

10.2.2 General Case, 363 

10.3 Parameterization in Terms of Main Effects and 
Interactions, 365 

10.4 Analysis of 3 n Experiments, 366 

10.5 Confounding in a 3 n Factorial, 368 

10.5.1 The 3 3 Experiment in Blocks of Size 3, 369 

10.5.2 Using SAS PROC FACTEX, 370 

10.5.3 General Case, 374 

10.6 Useful Systems of Confounding, 374 

10.6.1 Two Factors, 376 

10.6.2 Three Factors, 376 

10.6.3 Treatment Comparisons, 376 

10.6.4 Four Factors, 379 

10.6.5 Five Factors, 379 

10.6.6 Double Confounding, 379 

10.7 Analysis of Confounded 3 n Factorials, 380 

10.7.1 Intrablock Information, 381 

10.7.2 The ANOVAs, 381 

10.7.3 Tests of Hypotheses, 384 

10.7.4 Interblock Information, 385 

10.7.5 Combined Intra- and Interblock Information, 386 

10.7.6 Estimation of Weights, 386 

10.8 Numerical Example, 387 

10.8.1 Intrablock Analysis, 387 

10.8.2 Combined Analysis, 387 

11 General Symmetrical Factorial Design 

11.1 Introduction, 393 

11.2 Representation of Effects and Interactions, 395 

11.3 Generalized Interactions, 396 

11.4 Systems of Confounding, 398 

11.5 Intrablock Subgroup, 400 

11.6 Enumerating Systems of Confounding, 402 


359 


393 



CONTENTS 

11.7 Fisher Plans, 403 

11.7.1 Existence and Construction, 403 

11.7.2 Identifying System of Confounding, 406 

11.7.3 Application to Fisher Plans, 407 

11.8 Symmetrical Factorials and Finite Geometries, 409 

11.9 Parameterization of Treatment Responses, 410 

11.10 Analysis of p n Factorial Experiments, 412 

11.10.1 Intrablock Analysis, 413 

11.10.2 Disconnected Resolved Incomplete Block 
Designs, 417 

11.10.3 Analysis of Variance Tables, 420 

11.11 Interblock Analysis, 421 

11.11.1 Combining Interblock Information, 422 

11.11.2 Estimating Confounded Interactions, 425 

11.12 Combined Intra- and Interblock Information, 426 

11.12.1 Combined Estimators, 427 

11.12.2 Variance of Treatment Comparisons, 430 

11.13 The s n Factorial, 431 

11.13.1 Method of Galois Field Theory, 431 

11.13.2 Systems of Confounding, 433 

11.13.3 Method of Pseudofactors, 435 

11.13.4 The (p\ x p 2 乂…乂 p m ) n Factorial, 443 

11.14 General Method of Confounding for the Symmetrical 
Factorial Experiment, 447 

11.14.1 Factorial Calculus, 448 

11.14.2 Orthogonal Factorial Structure (OFS), 452 

11.14.3 Systems of Confounding with OFS, 453 

11.14.4 Constructing Systems of Confounding, 457 

11.14.5 Verifying Orthogonal Factorial Structure, 459 

11.14.6 Identifying Confounded Interactions, 462 

11.15 Choice of Initial Block, 463 

12 Confounding in Asymmetrical Factorial Designs 

12.1 Introduction, 466 

12.2 Combining Symmetrical Systems of Confounding, 467 

12.2.1 Construction of Blocks, 467 

12.2.2 Properties of Kronecker Product Method, 469 

12.2.3 Use of Pseudofactors, 474 


xiii 


466 



xiv 


CONTENTS 


12.3 The GC/n Method, 477 

12.3.1 Description of the Method, 478 

12.3.2 Choice of Generators, 479 

12.3.3 Loss of Degrees of Freedom, 479 

12.4 Method of Finite Rings, 480 

12.4.1 Mathematics of Ideals and Rings, 481 

12.4.2 Treatment Representations, 483 

12.4.3 Representation of Main Effects and 
Interactions, 483 

12.4.4 Parameterization of Treatment Responses, 485 

12.4.5 Characterization and Properties of the 
Parameterization, 488 

12.4.6 Other Methods for Constructing Systems of 
Confounding, 491 

12.5 Balanced Factorial Designs (BFD), 491 

12.5.1 Definitions and Properties of BFDs, 493 

12.5.2 EGD-PBIBs and BFDs, 499 

12.5.3 Construction of BFDs, 502 

13 Fractional Factorial Designs 

13.1 Introduction, 507 

13.2 Simple Example of Fractional Replication, 509 

13.3 Fractional Replicates for 2 n Factorial Designs, 513 

13.3.1 The y Fraction, 513 

13.3.2 Resolution of Fractional Factorials, 516 

13.3.3 Word Length Pattern, 518 

13.3.4 Criteria for Design Selection, 518 

13.4 Fractional Replicates for 3" Factorial Designs, 524 

13.5 General Case of Fractional Replication, 529 

13.5.1 Symmetrical Factorials, 529 

13.5.2 Asymmetrical Factorials, 529 

13.5.3 General Considerations, 531 

13.5.4 Maximum Resolution Design, 534 

13.6 Characterization of Fractional Factorial Designs of 
Resolution III, IV, and V, 536 

13.6.1 General Formulation, 536 

13.6.2 Resolution III Designs, 538 

13.6.3 Resolution IV Designs, 539 

13.6.4 Foldover Designs, 543 


507 



CONTENTS 


XV 


13.6.5 Resolution V Designs, 546 

13.7 Fractional Factorials and Combinatorial Arrays, 547 

13.7.1 Orthogonal Arrays, 547 

13.7.2 Balanced Arrays, 549 

13.8 Blocking in Fractional Factorials, 549 

13.8.1 General Idea, 549 

13.8.2 Blocking in Designs, 550 

13.8.3 Optimal Blocking, 551 

13.9 Analysis of Unreplicated Factorials, 558 

13.9.1 Half-Normal Plots, 558 

13.9.2 Bar Chart, 562 

13.9.3 Extension to Nonorthogonal Design, 563 

14 Main Effect Plans 

14.1 Introduction, 564 

14.2 Orthogonal Resolution III Designs for Symmetrical 
Factorials, 564 

14.2.1 Fisher Plans, 564 

14.2.2 Collapsing Factor Levels, 567 

14.2.3 Alias Structure, 567 

14.2.4 Plackett-Burman Designs, 575 

14.2.5 Other Methods, 577 

14.3 Orthogonal Resolution III Designs for Asymmetrical 
Factorials, 582 

14.3.1 Kronecker Product Design, 583 

14.3.2 Orthogonality Condition, 583 

14.3.3 Addelman - Kempthome Methods, 586 

14.4 Nonorthogonal Resolution III Designs, 594 

15 Supersaturated Designs 

15.1 Introduction and Rationale, 596 

15.2 Random Balance Designs, 596 

15.3 Definition and Properties of Supersaturated Designs, 597 

15.4 Construction of Two-Level Supersaturated Designs, 598 

15.4.1 Computer Search Designs, 598 

15.4.2 Hadamard-Type Designs, 599 

15.4.3 BIBD-Based Supersaturated Designs, 601 

15.5 Three-Level Supersaturated Designs, 603 

15.6 Analysis of Supersaturated Experiments, 604 


564 


596 



xvi 

16 Search Designs 

16.1 Introduction and Rationale, 608 

16.2 Definition of Search Design, 608 

16.3 Properties of Search Designs, 609 

16.3.1 General Case, 609 

16.3.2 Main Effect Plus One Plans, 611 

16.3.3 Resolution V Plus One Plans, 614 

16.3.4 Other Search Designs, 614 

16.4 Listing of Search Designs, 615 

16.4.1 Resolution III.l Designs, 615 

16.4.2 Resolution V.l Designs, 616 

16.5 Analysis of Search Experiments, 617 

16.5.1 General Setup, 617 

16.5.2 Noiseless Case, 618 

16.5.3 Noisy Case, 625 

16.6 Search Probabilities, 630 


17 Robust-Design Experiments 


17.1 Off-Line Quality Control, 633 

17.2 Design and Noise Factors, 634 

17.3 Measuring Loss, 635 

17.4 Robust-Design Experiments, 636 

17.4.1 Kronecker Product Arrays, 636 

17.4.2 Single Arrays, 636 

17.5 Modeling of Data, 638 

17.5.1 Location and Dispersion Modeling, 638 

17.5.2 Dual-Response Modeling, 641 

18 Lattice Designs 

18.1 Definition of Quasi-Factorial Designs, 649 

18.1.1 An Example: The Design, 649 

18.1.2 Analysis, 650 

18.1.3 General Definition, 653 

18.2 Types of Lattice Designs, 653 

18.3 Construction of One-Restrictional Lattice Designs, 655 

18.3.1 Two-Dimensional Lattices, 655 

18.3.2 Three-Dimensional Lattices, 656 

18.3.3 Higher-Dimensional Lattices, 657 


CONTENTS 

608 


633 


649 



CONTENTS 


xvii 


18.4 General Method of Analysis for One-Restrictional Lattice 
Designs, 657 

18.5 Effects of Inaccuracies in the Weights, 661 

18.6 Analysis of Lattice Designs as Randomized Complete Block 
Designs, 666 

18.7 Lattice Designs as Partially Balanced Incomplete Block 
Designs, 669 

18.8 Lattice Designs with Blocks of Size K 1 ，670 

18.9 Two-Restrictional Lattices, 671 

18.9.1 Lattice Squares with K Prime, 671 

18.9.2 Lattice Squares for General K, 675 

18.10 Lattice Rectangles, 678 

18.11 Rectangular Lattices, 679 

18.11.1 Simple Rectangular Lattices, 680 

18.11.2 Triple Rectangular Lattices, 681 

18.12 Efficiency Factors, 682 

19 Crossover Designs 

19.1 Introduction, 684 

19.2 Residual Effects, 685 

19.3 The Model, 685 

19.4 Properties of Crossover Designs, 687 

19.5 Construction of Crossover Designs, 688 

19.5.1 Balanced Designs for p = t, 688 

19.5.2 Balanced Designs for p < t, 688 

19.5.3 Partially Balanced Designs, 691 

19.5.4 Strongly Balanced Designs for p = ? + 1, 691 

19.5.5 Strongly Balanced Designs for p < t，692 

19.5.6 Balanced Uniform Designs, 693 

19.5.7 Strongly Balanced Uniform Designs, 693 

19.5.8 Designs with Two Treatments, 693 

19.6 Optimal Designs, 695 

19.6.1 Information Matrices, 695 

19.6.2 Optimality Results, 697 

19.7 Analysis of Crossover Designs, 699 

19.8 Comments on Other Models, 706 

19.8.1 No Residual Effects, 706 

19.8.2 No Period Effects, 707 

19.8.3 Random Subject Effects, 707 


684 



xviii CONTENTS 

19.8.4 Autocorrelation Error Structure, 707 

19.8.5 Second-Order Residual Effects, 708 

19.8.6 Self and Mixed Carryover Effects, 710 

19.8.7 Factorial Treatment Structure, 713 

Appendix A Fields and Galois Fields 716 

Appendix B Finite Geometries 721 

Appendix C Orthogonal and Balanced Arrays 724 

Appendix D Selected Asymmetrical Balanced Factorial Designs 728 

Appendix E Exercises 736 

References 749 

Author Index 767 

Subject Index 771 



Preface 


The project of revising Kempthorne’s 1952 book Design and Analysis of Exper¬ 
iments started many years ago. Our desire was to not only make minor changes 
to what had become a very successful book but to update it and incorporate new 
developments in the field of experimental design. Our involvement in teaching 
this topic to graduate students led us soon to the decision to separate the book 
into two volumes, one for instruction at the MS level and one for instruction and 
reference at the more advanced level. 

Volume 1 (Hinkelmann and Kempthorne, 1994) appeared as an Introduction 
to Experimental Design. It lays the philosophical foundation and discusses the 
principles of experimental design, going back to the ground-breaking work of the 
founders of this field, R. A. Fisher and Frank Yates. At the basis of this devel¬ 
opment lies the randomization theory as advocated by Fisher and the further 
development of these ideas by Kempthorne in the form of derived linear mod¬ 
els. All the basic error control designs, such as completely randomized design, 
block designs, Latin square type designs, split-plot designs, and their associated 
analyses are discussed in this context. In doing so we draw a clear distinction 
among the three components of an experimental design: the error control design, 
the treatment design, and the sampling design. 

Volume 2 builds upon these foundations and provides more details about cer¬ 
tain aspects of error control and treatment designs and the connections between 
them. Much of the effort is concentrated on the construction of incomplete block 
designs for various types of treatment structures, including “ordinary” treatments, 
control and test treatments, and factorial treatments. This involves, by necessity, 
a certain amount of combinatorics and leads, almost automatically, to the notions 
of balancedness, partial balancedness, orthogonality, and uniformity. These, of 
course, are also generally desirable properties of experimental designs and aspects 
of their analysis. 

In our discussion of ideas and methods we always emphasize the historical 
developments of and reasons for the introduction of certain designs. The devel¬ 
opment of designs was often dictated by computational aspects of the ensuing 
analysis, and this, in turn, led to the properties mentioned above. Even though 
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in the age of powerful computers and the wide availability of statistical com¬ 
puter software these concerns no longer play the dominant feature, we remind 
the reader that such properties have general statistical appeal and often serve as 
starting points for new developments. Moreover, we caution the reader that not 
all software can be trusted all the time when it comes to the analysis of data 
from completely unstructured designs, apart from the fact that the interpretation 
of the results may become difficult and ambiguous. 

The development and introduction of new experimental designs in the last 50 
years or so has been quite staggering, brought about, in large part, by an ever- 
widening field of applications and also by the mathematical beauty and challenge 
that some of these designs present. Whereas many designs had their origin in 
agricultural field experiments, it is true now that these designs as well as modifica¬ 
tions, extensions, and new developments were initiated by applications in almost 
all types of experimental research, including industrial and clinical research. It 
is for this reason that books have been written with special applications in mind. 
We, on the other hand, have tried to keep the discussion in this book as general 
as possible, so that the reader can get the general picture and then apply the 
results in whatever area of application is desired. 

Because of the overwhelming amount of material available in the literature, we 
had to make selections of what to include in this book and what to omit. Many 
special designs or designs for special cases (parameters) have been presented 
in the literature. We have concentrated, generally speaking, on the more gen¬ 
eral developments and results, providing and discussing methods of constructing 
rather large classes of designs. Here we have built upon the topics discussed in 
Kempthome’s 1952 book and supplemented the material with more recent top¬ 
ics of theoretical and applications oriented interests. Overall, we have selected 
the material and chosen the depth of discussion of the various topics in order to 
achieve our objective for this book, namely to serve as a textbook at the advanced 
graduate level and as a reference book for workers in the field of experimen¬ 
tal design. The reader should have a solid foundation in and appreciation of 
the principles and fundamental notions of experimental design as discussed, for 
example, in Volume 1. We realize that the material presented here is more than 
can be covered in a one-semester course. Therefore, the instructor will have to 
make choices of the topics to be discussed. 

In Chapters 1 through 6 we discuss incomplete block and row-column designs 
at various degrees of specificity. In Chapter 1 we lay the general foundation for 
the notion and analysis of incomplete block designs. This chapter is essential 
because its concepts permeate through almost every chapter of the book, in 
particular the ideas of intra- and interblock analyses. Chapters 2 through 5 are 
devoted to balanced and partially balanced incomplete block designs, their spe¬ 
cial features and methods of construction. In Chapter 6 we present some other 
types of incomplete block designs, such as a-designs and control-test treatment 
comparison designs. Further, we discuss various forms of row-column designs 
as examples of the use of additional blocking factors. 



PREFACE 


XXI 


In Chapters 7 through 13 we give a general discussion of the most fundamental 
and important ideas of factorial designs, beginning with factors at two levels 
(Chapters 7 through 9), continuing with the case of factors with three levels 
(Chapter 10) through the general case of symmetrical and asymmetrical factorial 
designs (Chapters 11 and 12), and concluding with the important concept of 
fractional factorial designs (Chapter 13). In these chapters we return often to the 
notion of incomplete block designs as we discuss various systems of confounding 
of interaction effects with block effects. 

Additional topics involving factorial designs are taken up in Chapters 14 
through 17. In Chapter 14 we discuss the important concept of main effect plans 
and their construction. This notion is then extended to supersaturated designs 
(Chapter 15) and incorporated in the ideas of search designs (Chapter 16) and 
robust-design or Taguchi experiments (Chapter 17). We continue with an exten¬ 
sive chapter about lattice designs (Chapter 18), where the notions of factorial and 
incomplete block designs are combined in a unique way. We conclude the book 
with a chapter on crossover designs (Chapter 19) as an example where the ideas 
of optimal incomplete row-column designs are complemented by the notion of 
carryover effects. 

In making a selection of topics for teaching purposes the instructor should 
keep in mind that we consider Chapters 1, 7, 8, 10, and 13 to be essential for the 
understanding of much of the material in the book. This material should then be 
supplemented by selected parts from the remaining chapters, thus providing the 
student with a good understanding of the methods of constructing various types 
of designs, the properties of the designs, and the analyses of experiments based 
on these designs. The reader will notice that some topics are discussed in more 
depth and detail than others. This is due to our desire to give the student a solid 
foundation in what we consider to be fundamental concepts. 

In today’s computer-oriented environment there exist a number of software 
programs that help in the construction and analysis of designs. We have chosen 
to use the Statistical Analysis System (S AS) for these purposes and have provided 
throughout the book examples of input statements and output using various pro¬ 
cedures in SAS, both for constructing designs as well as analyzing data from 
experiments based on these designs. For the latter, we consider, throughout, 
various forms of the analysis of variance to be among the most important and 
informative tools. 

As we have mentioned earlier, Volume 2 is based on the concepts developed 
and described in Volume 1. Nevertheless, Volume 2 is essentially self-contained. 
We make occasional references to certain sections in Volume 1 in the form 
(I.xx.yy) simply to remind the reader about certain notions. We emphasize again 
that the entire development is framed within the context of randomization theory 
and its approximation by normal theory inference. It is with this fact in mind 
that we discuss some methods and ideas that are based on normal theory. 

There exist a number of books discussing the same types of topics that we 
exposit in this book, some dealing with only certain types of designs, but per¬ 
haps present more details than we do. For some details we refer to these books 
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in the text. A quite general, but less detailed discussion of various aspects of 
experimental design is provided by Cox and Reid (2000). 

Even though we have given careful attention to the selection of material for 
this book, we would be remiss if we did not mention that certain areas are 
completely missing. For example, the reader will not find any discussion of 
Bayesian experimental design. This is, in part, due to our philosophical attitude 
toward the Bayesian inferential approach (see Kempthome, 1984; Hinkelmann, 
2001). To explain, we strongly believe that design of experiment is a Bayesian 
experimentation process, not a Bayesian inference process, but one in which the 
experimenter approaches the experiment with some beliefs, to which he accom¬ 
modates the design. It is interesting to speculate whether precise mathematical 
formulation of informal Bayesian thinking will be of aid in design. Another area 
that is missing is that of sequential design. Here again, we strongly believe and 
encourage the view that most experimentation is sequential in an operational 
sense. Results from one, perhaps exploratory, experiment will often lead to fur¬ 
ther, perhaps confirmatory, experimentation. This may be done informally or 
more formally in the context of sequential probability ratio tests, which we do 
not discuss explicitly. Thus, the selection and emphases are to a certain extent 
subjective and reflect our own interests as we have taught over the years parts 
of the material to our graduate students. 

As mentioned above, the writing of this book has extended over many years. 
This has advantages and disadvantages. My (K.H.) greatest regret, however, is 
that the book was not completed before the death of my co-author, teacher, and 
mentor, Oscar Kempthome. I only hope that the final product would have met 
with his approval. 

This book could not have been completed without the help from others. First, 
we would like to thank our students at Virginia Tech, Iowa State University, and 
the University of Dortmund for their input and criticism after being exposed to 
some of the material. K.H. would like to thank the Departments of Statistics at 
Iowa State University and the University of Dortmund for inviting him to spend 
research leaves there and providing him with support and a congenial atmosphere 
to work. We are grateful to Michele Marini and Ayca Ozol-Godfrey for providing 
critical help with some computer work. Finally, we will never be able to fully 
express our gratitude to Linda Breeding for her excellent expert word-processing 
skills and her enormous patience in typing the manuscript, making changes after 
changes to satisfy our and the publisher’s needs. It was a monumental task and 
she did as well as anybody possibly could. 


Klaus Hinkelmann 


Blacksburg, VA 
May 2004 



CHAPTER 1 


General Incomplete Block Design 


1.1 INTRODUCTION AND EXAMPLES 

One of the basic principles in experimental design is that of reduction of experi¬ 
mental error. We have seen (see Chapters 1.9 and 1.10) that this can be achieved 
quite often through the device of blocking. This leads to designs such as ran¬ 
domized complete block designs (Section 1.9.2) or Latin square type designs 
(Chapter 1.10). A further reduction can sometimes be achieved by using blocks 
that contain fewer experimental units than there are treatments. 

The problem we shall be discussing then in this and the following chapters is 
the comparison of a number of treatments using blocks the size of which is less 
than the number of treatments. Designs of this type are called incomplete block 
designs (see Section 1.9.8). They can arise in various ways of which we shall 
give a few examples. 

In the case of field plot experiments, the size of the plot is usually, though 
by no means always, fairly well determined by experimental and agronomic 
techniques, and the experimenter usually aims toward a block size of less than 
12 plots. If this arbitrary rule is accepted, and we wish to compare 100 varieties 
or crosses of inbred lines, which is not an uncommon situation in agronomy, 
we will not be able to accommodate all the varieties in one block. Instead, we 
might use, for example 10 blocks of 10 plots with different arrangements for 
each replicate (see Chapter 18). 

Quite often a block and consequently its size are determined entirely on bio¬ 
logical or physical grounds, as, for example, a litter of mice, a pair of twins, 
an individual, or a car. In the case of a litter of mice it is reasonable to assume 
that animals from the same litter are more alike than animals from different lit¬ 
ters. The litter size is, of course, restricted and so is, therefore, the block size. 
Moreover, if one were to use female mice only for a certain investigation, the 
block size would be even more restricted, say to four or five animals. Hence, 
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Notice that with this arrangement every treatment is replicated four times, and 
every pair of treatments occurs together twice in the same block; for example, 
T\ and occur together in blocks 3 and 4. 

Many sociological and psychological studies have been done on twins because 
they are “alike” in many respects. If they constitute a block, then the block 
size is obviously two. A number of incomplete block designs are available 
for this type of situation, for example, Kempthome (1953) and Zoellner and 
Kempthorne (1954). 

Blocks of size two arise also in some medical studies, when a patient is 
considered to be a block and his eyes or ears or legs are the experimental units. 

With regard to a car being a block, this may occur if we wish to compare 
brands of tires, using the wheels as the experimental units. In this case one may 
also wish to take the effect of position of the wheels into account. This then 
leads to an incomplete design with two-way elimination of heterogeneity (see 
Chapters 6 and 1.10). 

These few examples should give the reader some idea why and how the need 
for incomplete block designs arises quite naturally in different types of research. 
For a given situation it will then be necessary to select the appropriate design 
from the catalogue of available designs. We shall discuss these different types 
of designs in more detail in the following chapters along with the appropriate 
analysis. 

Before doing so, however, it seems appropriate to trace the early history 
and development of incomplete block designs. This development has been a 
remarkable achievement, and the reader will undoubtedly realize throughout the 
next chapters that the concept of incomplete block designs is fundamental to the 
understanding of experimental design as it is known today. 


comparing more than this number of treatments would require some type of 
incomplete block design. 

Suppose we wish to compare seven treatments, 7\, T 2 , 乃， 75 , Tj, say, 
using female mice, and suppose we have several litters with four females. We 
then could use the following incomplete block design, which, as will be explained 
later, is a balanced incomplete block design: 

Animal 

Litter 12 3 4 


re^^re^rlre 

n7n5n2n3n3n4n5 

T T T T T T T 

7476rl72777374 

rlTaTyrl^TS^ 

12 3 4 5 6 7 
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The origins of incomplete block designs go back to Yates (1936a) who intro¬ 
duced the concept of balanced incomplete block designs and their analysis utiliz¬ 
ing both intra- and interblock information (Yates, 1940a). Other incomplete block 
designs were also proposed by Yates (1936b, 1937a, 1940b), who referred to these 
designs as quasi-factorial or lattice designs. Further contributions in the early his¬ 
tory of incomplete block designs were made by Bose (1939, 1942) and Fisher 
(1940) concerning the structure and construction of balanced incomplete block 
designs. The notion of balanced incomplete block design was generalized to that 
of partially balanced incomplete block designs by Bose and Nair (1939)，which 
encompass some of the lattice designs introduced earlier by Yates. Further exten¬ 
sions of the balanced incomplete block designs and lattice designs were made 
by Youden (1940) and Harshbarger (1947), respectively, by introducing balanced 
incomplete block designs for eliminating heterogeneity in two directions (gener¬ 
alizing the concept of the Latin square design) and rectangular lattices some of 
which are more general designs than partially balanced incomplete block designs. 
After this there has been a very rapid development in this area of experimental 
design, and we shall comment on many results more specifically in the following 
chapters. 


1.2 GENERAL REMARKS ON THE ANALYSIS OF INCOMPLETE 
BLOCK DESIGNS 

The analysis of incomplete block designs is different from the analysis of com¬ 
plete block designs in that comparisons among treatment effects and comparisons 
among block effects are no longer orthogonal to each other (see Section 1.7.3). 
This is referred to usually by simply saying that treatments and blocks are not 
orthogonal. This nonorthogonality leads to an analysis analogous to that of the 
two-way classification with unequal subclass numbers. However, this is only 
partly true and applies only to the analysis that has come to be known as the 
intrablock analysis. 

The name of the analysis is derived from the fact that contrasts in the treat¬ 
ment effects are estimated as linear combinations of comparisons of observations 
in the same block. In this way the block effects are eliminated and the estimates 
are functions of treatment effects and error (intrablock error) only. Coupled with 
the theory of least squares and the Gauss-Markov theorem (see 1.4.16.2)，this 
procedure will give rise to the best linear unbiased intrablock estimators for treat¬ 
ment comparisons. Historically, this has been the method first used for analyzing 
incomplete block designs (Yates, 1936a). We shall derive the intrablock analysis 
in Section 1.3. 

Based upon considerations of efficiency, Yates (1939) argued that the intra¬ 
block analysis ignores part of the information about treatment comparisons, 
namely that information contained in the comparison of block totals. This analysis 
has been called recovery of interblock information or interblock analysis. 
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Yates (1939 ， 1940a) showed for certain types of lattice designs and for the 
balanced incomplete block design how these two types of analyses can be com¬ 
bined to yield more efficient estimators of treatment comparisons. Nair (1944) 
extended these results to partially balanced incomplete block designs, and Rao 
(1947a) gave the analysis for any incomplete block design showing the similarity 
between the intrablock analysis and the combined intra- and interblock analysis. 

The intrablock analysis, as it is usually presented, is best understood by assum¬ 
ing that the block effects in the underlying linear model are fixed effects. But for 
the recovery of interblock information the block effects are then considered to 
be random effects. This leads sometimes to confusion with regard to the assump¬ 
tions in the combined analysis, although it should be clear from the previous 
remark that then the block effects have to be considered random effects for both 
the intra- and interblock analysis. To emphasize it again, we can talk about intra¬ 
block analysis under the assumption of either fixed or random block effects. In 
the first case ordinary least squares (OLS) will lead to best linear unbiased esti¬ 
mators for treatment contrasts. This will, at least theoretically, not be true in the 
second case, which is the reason for considering the interblock information in 
the first place and using the Aitken equation (see 1.4.16.2)，which is also referred 
to as generalized (weighted) least squares. 

We shall now derive the intrablock analysis (Section 1.3), the interblock 
analysis (Section 1.7)，and the combined analysis (Section 1.8) for the general 
incomplete block design. Special cases will then be considered in the following 
chapters. 


1.3 THE INTRABLOCK ANALYSIS 
1.3.1 Notation and Model 

Suppose we have t treatments replicated r\, ... , r r times, respectively, and 

b blocks with k\ ， k:, … ， kb units, respectively. We then have 

t b 

Y/ i = H k j = n 

i=l )=1 

where n is the total number of observations. 

Following the derivation of a linear model for observations from a random¬ 
ized complete block design (RCBD), using the assumption of additivity in the 
broad sense (see Sections 1.9.2.2 and 1.9.2.6), an appropriate linear model for 
observations from an incomplete block design is 

yijt = M + T/ + Pj + eiji (1.1) 

(i = 1,2,... j = 1,2,... ,b; € = 0, 1,..., riij), where r/ is the effect of the 

ith treatment, the effect of the jth block, and eiji the error associated with the 
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observation As usual, the eyi contain both experimental and observational 
(sampling) error, that is, using notation established in Volume 1, 

^iji = ^ijt + rjiji 

with etji representing experimental error and rjij^ representing observational 
error. Also, based on previous derivations (see 1.6.3.4), we can treat the etjt 
as i.i.d. random variables with mean zero and variance + Note that 

because riij, the elements of the incidence matrix N, may be zero, not all treat¬ 
ments occur in each block which is, of course, the definition of an incomplete 
block design. 

Model (1.1) can also be written in matrix notation as 

y = + X t t + X + e (1.2) 

where 3 is a column vector consisting of n unity elements, Xp is the observation- 
block incidence matrix 

~0 kl " 

0k 2 

= •• 

_ 

with 3kj denoting a column vector of kj unity elements (j = 1,2,... ,b) and 

X r = (XuX 2 ,...,X t ) 

is the observation-treatment incidence matrix, where Xi is a column vector with 
ri unity elements and (n — r/) zero elements such that x-x/ = r/ and = 0 
for i ^ /’(/ ， i' = 1 , 2 ,..., t). 


1.3.2 Normal and Reduced Normal Equations 

The normal equations (NE) for /x, Ti, and Pj are then 


t b 

r(Ti + y^kj^j = G 
7=1 
b 

+ 〉: j ^j = (i = 1 , 2 ,...,?) 

y=i 

t 

kjll + ^ njfZj + kjpj = Bj (j = 1,2,… ， b) 


(1.3) 
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where 

^ yiji = ith treatment total 

j 么 

Bj = ^ yiji = jth block total 

u 

i 


Ti = ^ Bj = overall total 


Equations (1.3) can be written in matrix notation as 


0' n X r J' n XA 





X' x 3n X' x X x X' r X p 



= 

Ky 

X^X r 




K x ， ^y) 


which, using the properties of 0, X T , X^, can be written as 


-0 ； ,0n 3 ； 7? y b K- 




~G~ 

R3, R N 



= 

T 

.K3 b N' K _ 






where 

R = diag (ri) t x t 

K = diag (kj) bxb 

N = (riij) t x b (the incidence matrix) 

= (T U T 2 , … ， T t ) 

B r = (B\, B2, …， Btd 

T = (Tj , T2, . . . ， TV) 

P ’ = (01， #2, ... ，你） 


(1.4) 


(1.5) 


and the 3’s are column vectors of unity elements with dimensions indicated by 
the subscripts. From the third set of equations in (1.5) we obtain 


^ b +J = K-\B-N r r) 


(1.6) 
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Substituting (1.6) into the second set of (1.5), which can also be expressed 
as N0b[ji + NP + Kr = T (since N3b = R0 t ), leads to the reduced normal 
equations (RNE) (see Section 1.4.7.1) for r 

(R - =T - NK X B (1.7) 


Standard notation for (1.7) is 


Cr= Q 


where 


C = R- NK_ l N' 


and 


Q = T - NK l B 


the (/, i f ) element of C being 


Cii 


8ii，n - 二 . 


(l.B) 

(1.9) 

( 1 . 10 ) 


with Sa' — 1 for i = i' and = 0 otherwise, and the «th element of Q being 

7=1 J 

And Qi is called the ith adjusted treatment total ， the adjustment being due to 
the fact that the treatments do not occur the same number of times in the blocks. 


1.3.3 The C Matrix and Estimable Functions 

We note that the matrix C of (1.9) is determined entirely by the specific design, 
that is, by the incidence matrix N. It is, therefore, referred to as the C matrix 
(sometimes also as the information matrix) of that design. The C matrix is 
symmetric, and the elements in any row or any column of C add to zero, that 
is, CO = 0, which implies that r(C) = rank(C) < ^ — 1. Therefore, C does not 
have an inverse and hence (1.8) cannot be solved uniquely. Instead we write a 
solution to (1.8) as 

r = CQ (1.11) 

where C~ is a generalized inverse for C (see Section 1.3.4). 
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If we write C = (ci, C 2 ,..., c t ), where c/ is the /th column of C, then the 
set of linear functions 

{^i T, / = 1,2, . . . , t} 

which span the totality of estimable functions of the treatment effects, has dimen¬ 
sionality r(C). Let c f r be an estimable function and c'^x its estimator, with T 
from (1.11). Then 

E(c f r) = E {c f C~ Q) 

= c f C~E(Q) 

= cCCr 

For c'^x to be an unbiased estimator for c'x for any r, we then must have 

cCC = c f (1.12) 

Since CO = 0, it follows from (1.12) that c f 3 = 0. Hence, only treatment con¬ 
trasts are estimable. If r(C) = t — l, then all treatment contrasts are estimable. 
In particular, all differences r/ — (/ ^ i f ) are estimable, there being t — l lin¬ 

early independent estimable functions of this type. Then the design is called a 
connected design (see also Section 1.4.13.3). 

1.3.4 Solving the Reduced Normal Equations 

In what follows we shall assume that the design is connected; that is, r(C)= 
t — l. This means that C has t — \ nonzero (positive) eigenvalues and one zero 
eigenvalue. From 



" 1 " 


" 1 " 


1 


1 

c 

1 

= 0 = 0 

1 


it follows then that (1,1,..., \) f is an eigenvector corresponding to the zero 
eigenvalue. If we denote the nonzero eigenvalues of C by du dz, … ， d t -[ 
and the corresponding eigenvectors by with - = 1 (i = 

1,2,..., t — l) and •/ = 0(/ # /’)，then we can write C in its spectral decom¬ 
position as 

t-i 

C = 


( 1 . 13 ) 
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or with d t = 0 and ^ 1 , ..., 1 ), alternatively as 

t 

(1-14) 

i=\ 

We note that = 1 and d = 0 for / = 1 ， 2, ...，，一 1. 

We now return to (1.8) and consider a solution to these equations of the 
form given by (1.11). Although there are many methods of finding generalized 
inverses, we shall consider here one particular method, which is most useful 
in connection with incomplete block designs, especially balanced and partially 
balanced incomplete block designs (see following chapters). This method is based 
on the following theorem, which is essentially due to Shah (1959). 

Theorem 1.1 Let C bt sl t x t matrix as given by (1.9) with r(C) = t — l. 
〜 / 〜一 l 

Then C = C + a30 , where a # 0 is a real number, admits an inverse C , and 
〜一 l 

C is a generalized inverse for C. 

Proof 

(a) We can rewrite C as 

T 

〜 1 

C = C + aOO' = C + a . ( 1 , 1 , 1 ) = C + at^X 


and because of (1.13) 

t-i 

C = (1.15) 

/ = 1 

Clearly, C has nonzero roots d\, d:, …， d t 一 \， d t = at and hence is non¬ 
singular. Then 

^~ l + (116) 
i=l 

(b) To show that C 1 = C~ we consider CC 1 C. From (1.13), (1.15), and 
(1.16) we have 
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and 

t— 1 1 

CC l = = = I - (1.17) 

i=l 

cc~ l c = c 

which implies 



We remark here already that determining C 一 for the designs in the following 

chapters will be based on (1.17) rather than on (1.14). 

〜一 l 

Substituting C into (1.13) then yields a solution of the RNE (1.8); that is, 

r = C _1 e (1.18) 

We note that because of (1.8) and (1.16) 

E(r) = E(c~ 1 Q) 


= C~ l E(Q) 

=C _1 £(CT) 
〜一 1 

=C Cr 



with T = l/t • ti ; that is, E(r) is the same as if we had obtained a generalized 
inverse of C by imposing the condition zj = 0. □ 

1.3.5 Estimable Functions of Treatment Effects 

We know from the Gauss-Markov theorem (see Section 1.4.16.2) that for any 
linear estimable function of the treatment effects, say c f r, 

E(c r r) = c f r (1.19) 

is independent of the solution to the NE (see Section I.4.4.4). We have further 

var(c /1 r) = c r C 1 ca^ 


( 1 . 20 ) 
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with a corresponding result (but same numerical value) for any other solution 
obtained, using an available software package (see Section 1.14). We shall elab¬ 
orate on this point briefly. 

Let us rewrite model (1.2) as 

y = + X + X T r + e 

=(0 Xp X r ) 0 + e 
=X0 + e 

with 

X = (0 :X^: X r ) 
and 

0’ = O,T ’） 

The NE for model (1.21) are 

XX&* = X'y 

A solution to (1.23) is given by, say, 

0* = {X f X)~X f y 

for some {X r X)~ . Now {X r X)~ is a (l b 1) x (l b 1) matrix that we 

can partition conformably, using the form of X as given in (1.22), as 

(A 叫 A 邱 

(X f X)- = A^r (1.24) 

\Kr A， Pr A tt J 

Here, A rr is ^ t x t matrix that serves as the variance-covariance matrix for 
obtaining 

var(c’r*) = c' A rr ca 》 (1.25) 

For any estimable function c’t we have c f1 t = c’t* and also the numerical 

. a—1 

values for (1.20) and (1.25) are the same. If we denote the (/, i ) element of C 
by c 11 and the corresponding element of A rr in (1.24) by a 11 , then we have, for 
example, for c’r = 一 r" 

var(r ； -x v ) = (c"' - 2c"' + a] = (a 1 ' 1 ' - 2a n ' + 〆")(1.26) 

For a numerical example and illustration of computational aspects, see 
Section 1.13. 


( 1 - 21 ) 

( 1 - 22 ) 

(1.23) 
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1.3.6 Analyses of Variance 

It follows from general principles (see Section 1.4.7.1) that the two forms of 
analysis of variance are as given in Tables 1.1 and 1.2. We shall henceforth refer 
to the analysis of variance in Table 1.1 as the treatment-after-block ANOVA or 
TI B-ANOVA as it is associated with the ordered model 

y = [m3 + X + X r x + e 

whereas the analysis of variance in Table 1.2 is associated with the ordered model 

j = ^3 + X T r + + e 

and hence shall be referred to as the block-after-treatment ANOVA or B | T- 
ANOVA. To indicate precisely the sources of variation and the associated sums 
of squares, we use the notation developed in Section 1.4.7.2 for the general case 
as it applies to the special case of the linear model for the incomplete block 


Table 1.1 T|B-ANOYA for Incomplete Block Design 


Source 

d.f. a 

SS 


E(MS) 

Xp\0 

b- 1 

f 

h 

G 2 

n 


X Z \3,X^ 

t- 1 

/ =i 


0 t'Ct 

^ + ? -i 

I\0,X^X T 

n — b — t \ 

Difference 



Total 

n — \ 

ijl 

G 2 

n 



a d.f. = degrees of freedom. 


Table 1.2 B|T-ANOYA for Incomplete Block Design 


Source 

d.f. 

SS 


t - 1 

Tf G 2 

Xp\o,x T 

b- 1 

Difference 

i\o,x^,x T 

n — b — t l 

From Table 1.1 

Total 

n — l 
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design, thereby avoiding the commonly used but not always clearly understood 
terms blocks ignoring treatments for (X^ \ 0), treatments eliminating blocks for 
(X r I 3, X^), and blocks eliminating treatments for (X^ \ 3, X r ). 

The TI B-ANOVA follows naturally from the development of the RNE for the 
treatment effects. It is the appropriate ANOVA for the intrablock analysis as it 
allows to test the hypothesis 

Hq: T\=Z2 = • • • = r t 

by means of the (approximate) F test (see 1.9.2.5) 

SS(X T lO,X^)/(t-l) 

~ SS(HO,X^,X r )/(n-b-t + l) ' j 

Also MS(Error)= SS(/ | J, X T )/{n — b — t l) is an estimator for to 
be used for estimating var(c’T) of (1.20). 

The usefulness of the B | T-ANOVA in Table 1.2 will become apparent when 
we discuss specific aspects of the combined intra- and interblock analysis in 
Section 1.10. At this point we just mention that SS(X^ |, 3, X T ) could have been 
obtained from the RNE for block effects. Computationally, however, it is more 
convenient to use the fact that SS(/ | 0, X x ) = SS(7 | 3, X T , X^) and then 
obtain SS(X^ \ 3, X r ) by subtraction. 

Details of computational procedures using SAS PROC GLM and SAS PROC 
Mixed (5A5 1 1999-2000) will be described in Section 1.14. 

1.4 INCOMPLETE DESIGNS WITH VARIABLE BLOCK SIZE 

In the previous section we discussed the intrablock analysis of the general incom¬ 
plete block design; that is, a design with possibly variable block size and possibly 
variable number of replications. Although most designed experiments use blocks 
of equal size, k say, there exist, however, experimental situations where blocks of 
unequal size arise quite naturally. We shall distinguish between two reasons why 
this can happen and why caution may have to be exercised before the analysis 
as outlined in the previous section can be used: 

1. As pointed out by Pearce (1964, p. 699): 

With much biological material there are natural units that can be used as blocks and 
they contain plots to a number not under the control of the experimenter. Thus, the 
number of animals in a litter or the number of blossoms in a truss probably vary 
only within close limits. 

2 . Although an experiment may have been set up using a proper design, that 
is, a design with equal block size, missing plots due to accidents during 
the course of investigation will leave one for purpose of analysis with a 
design of variable block size. 
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In both cases there are two alternatives to handle the situation. In case 1 one 
may wish to reduce all blocks to a constant size, thereby reducing the number of 
experimental units available. If experimental units are at a premium, this may not 
be the most desirable course of action. The other alternative is to use the natural 
blocks and then use the analysis as given in the previous section. Before doing 
so we mention that its validity will depend on one very important assumption, 
and that is the constancy of the variance for all blocks. In general, the size 
of will depend on the size of the blocks: The larger the blocks, the larger 
will be since it is in part a measure of the variability of the experimental units 
within blocks (see 1.9.2.4). In fact, this is the reason for reducing the block size 
since it may also reduce the experimental error. Experience shows that such a 
reduction in is not appreciable for only modest reduction in block size. It is 
therefore quite reasonable to assume that is constant for blocks of different 
size if the number of experimental units varies only slightly. 

In case 2 one possibility is to estimate the missing values and then use the 
analysis for the proper design. Such a procedure, however, would only be approxi¬ 
mate. The exact analysis then would require the analysis with variable block size 
as in case 1. Obviously, the assumption of constancy of experimental error is 
satisfied here if is was satisfied for the original proper design. 


1.5 DISCONNECTED INCOMPLETE BLOCK DESIGNS 

In deriving the intrablock analysis of an incomplete block design in Section 1.3.4 
we have made the assumption that the C matrix of (1.9) has maximal rank t — 1, 
that is, the corresponding design is a connected design. Although connectedness 
is a desirable property of a design and although most designs have this property, 
we shall encounter designs (see Chapter 8) that are constructed on purpose as 
disconnected designs. We shall therefore comment briefly on this class of designs. 

Following Bose (1947a) a treatment and a block are said to be associated if 
the treatment is contained in that block. Two treatments are said to be connected 
if it is possible to pass from one to the other by means of a chain consisting alter¬ 
nately of treatments and blocks such that any two adjacent members of the chain 
are associated. If this holds true for any two treatments, then the design is said to 
be connected, otherwise it is said to be disconnected (see Section 1.4.13.3 for a 
more formal definition and Srivastava and Anderson, 1970). Whether a design is 
connected or disconnected can be checked easily by applying the definition given 
above to the incidence matrix N\ If one can connect two nonzero elements of 
N by means of vertical and horizontal lines such that the vertices are at nonzero 
elements, then the two treatments are connected. In order to check whether a 
design is connected, it is sufficient to check whether a given treatment is con¬ 
nected to all the other ^ — 1 treatments. If a design is disconnected, it follows 
then that (possibly after suitable relabeling of the treatments) the matrix NN r 
and hence C consist of disjoint block diagonal matrices such that the treatments 
associated with one of these submatrices are connected with each other. 
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Suppose C has m submatrices, that is, 

"Ci 

C 2 

c = 


where C v is t v x t v (Y^=i ^ = f). It then follows that rank (C v ) = t v — l(v = 
1, 2,..., m) and hence rank(C) = t — m. The RNE is still of the form (1.8) 

一 〜 一 l 

with a solution given by (1.11)，where in C 一 = C we now have, modifying 
Theorem 1.1, 







Table 1.3 T|B-ANOVA for Disconnected Incomplete Block Design 


Source 

d.f. 

SS 


b- 1 

j k ^ n 

x T \o,x^ 

t — m 

E 祕 

I\0,X^X T 

n — b — t + m 

Difference 

Total 

n — \ 

ijH 

Table 1.4 B|T-ANOVA for Disconnected Incomplete Block Design 

Source 

d.f. 

SS 

x T \o 

t- 1 

y T l^ 

/ r ' n 

x^\0,x T 

b — m 

Difference 

I\0,X T ,X^ 

n — t — b m 

From Table 1.3 

Total 

n — 1 




16 GENERAL INCOMPLETE BLOCK DESIGN 

with a v (y = 1 ， 2, ... ， m) arbitrary constants (#0) and the matrices are of 
appropriate dimensions. Following the development in Section 1.3.6，this then 
leads to the ANOVA tables as given in Tables 1.3 and 1.4. 


1.6 RANDOMIZATION ANALYSIS 

So far we have derived the analysis of data from incomplete block designs using 
a Gauss-Markov linear model as specified in (1.1). We have justified the appro¬ 
priate use of such an infinite population theory model in our earlier discussions 
of error control designs (see, e.g.，Sections 1.6.3 and 1.9.2) as a substitute for 
a derived, that is, finite, population theory model that takes aspects of random¬ 
ization into account. In this section we shall describe in mathematical terms the 
randomization procedure for an incomplete block design, derive an appropriate 
linear model, and apply it to the analysis of variance. This will show again, as 
we have argued in Section 1.9.2 for the RCBD, that treatment effects and block 
effects cannot be considered symmetrically for purposes of statistical inference. 


1.6.1 Derived Linear Model 

Following Folks and Kempthome (1960) we shall confine ourselves to proper 
(i.e., all kj = k )，equireplicate (i.e., all ri = r) designs. The general situation is 
then as follows: We are given a set of b blocks, each of constant size k ; a master 
plan specifies b sets of k treatments; these sets are assigned at random to the 
blocks; in each block the treatments are assigned at random to the experimental 
units (EU). This randomization procedure is described more formally by the 
following design random variables: 



if the wth set is assigned to the jth block 
otherwise 


and 



if the uv treatment is assigned to the 
Ith unit of the yth block 
otherwise 


(1.28) 


(1.29) 


The uv treatment is one of the t treatments that, for a given design, has been 
assigned to the uth set. 

Assuming additivity in the strict sense (see Section 1.6.3), the conceptual 
response of the uv treatment assigned to the £th EU in the jth block can be 
written as 

Tjiuv = Ujt~\~T uv ( 1 . 30 ) 
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where Uji is the contribution from the £th EU in the jth block and T uv is the 
contribution from treatment uv. We then write further 

Tjluv = u.. + (Uj.-U..) + {Uji - Uj.) + T.. + (T uv - T .) 

—b j -\- Tiiy H - u ji (1.31) 

where 

/x = f/.. + 7\. is the overall mean 

bj = Uj. — U„ is the effect of the jth block 
(j = 1 , 2,. .., b) 

z uv = t uv — T.. is the effect of the uv treatment 
(u = 1,2,..., b; v = 1,2,... ,k) 

Uj£ = Uji — U j, is the unit error 

(£ = 1 , 2 ,...,/:) 

with bj = 0 = t uv = Uj[ We then express the observed response 
for the uv treatment, y uv , as 

yuv = ^ ^ al j Tji uv 

j ^ 

=M + ^1 ； + XI ali j b j + H 

j j ^ 

=/X -j - T U y ~|- ~\~ OJ U y (1.32) 

where 

a u j bj (1.33) 

j 

is a random variable with 

E(M = o E(Pl) = I J2 b2 j E ^<M = b{b l _ X) Y, b ) ( " M ') 

j j 

Also, 

o^uv = Uji (1.34) 
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is a random variable with 

E(o) uv ) = 0 E(co^ v ) = — 〉: 

E(co uv o) uv ’)=- 队认 -V) ^ ^ … 

E(OJuv ⑴ u’v f 、 = 0 (w ^ M ) 

In deriving the properties of the random variables p u and co uv we have used, of 
course, the familiar distributional properties of the design random variables a u - 
and Sj^ , such as 



= 

=i)= 

b 



=1 laj, = 

=i)= 

0 

(j ^ /) 


=1 laj, = 

=i)= 

1 

(u ^ u\ j ^ /) 

b{b — 1) 


P(S^ = 

=i)= 

1 

k 


)( 哎 = 

:1) 1 (% = 

=i)= 

0 

a 关 e，) 

， ( 哎 = 

1) 1 (^；= 

=i)= 

1 

d v') 

k{k — 1) 

= 

i) i («= 

=i)= 

1 

¥ 

(j 羊 f'u 羊 U') 


and so on. 

1.6.2 Randomization Analysis of ANOVA Tables 

Using model (1.32) and its distributional properties as induced by the design 
random variables a 1 - and Sj^, we shall now derive expected values of the sums 
of squares in the analyses of variance as given in Tables 1.1 and 1.2: 

1. E(SS Total) = E E (y U v - y..) 2 

uv 

= E 〉 : {t uv + 凡 + oj uv )^ 

uv 

=X) r uv-^ k ^2 b2 j -^-^2 

⑽ j jt 



RANDOMIZATION ANALYSIS 


19 


2 . Eissix^o)] 

uv 

=k E (T m . + oj uv ) 2 

U V 

= k Yji + k J2 b2 j 

u j 

yi _ t _ b ~\~ \ x 一 > o 

3. £[SS(/|J,^,X r )]= 

since the incomplete block designs considered are unbiased. 

4. E[SS(X r I 0, X^)] can be obtained by subtraction. 

5. To obtain E[SS(X Z \ 3)] let 



if the wth treatment corresponds to the 
uv index (w = 1,2,... ,t) 
otherwise 



if the wth treatment occurs in the uth block 
otherwise 


where 



and 


Eru= r 


Then 


E[SS(X r \0)] = E 


E E^-tEE^- 


w \ uv 


E 


^2 [ rTw + ^ a + 


) ⑴ uv 


E t u, + -E e EruH +-E e Ey.^> 


COu 


w \ u 


w \ uv 
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=r 


Yl r ^ + ~H E 




w^u’ 




'Eru^l + E E yZyZ, 

v u vv r 

u^v r 

Yuvyu'v' a> uv^>u , y f 

— uu r vv' 
u^u r 


^uv^uv' 


Now 


E 


TXf 


E 


E yuY^uPw 


u 參 u’ 


EH 




uu 

u 參 u’ 


_ ▽ Yu( r -，、一-— ▽ 




rir - 1) 
bib - 1 ) 乙 


bib - 1 ) 




〜 2 


E 


E 




E 


E 


E 


\ 、 \ 、 ..w ..w ,.w.. 

/ , / , yuvy U v r co uv a> uv , 
u lV 
v^v' 

yuvYu ： v' ^uvO^u'v' 

uu r 
u 寺 u’ 


bk ^ Uul 


o since YuvYuv 


and hence 




bk 




Thus, we have for the mean squares (MS) from Tables 1.1 and 1.2 the expected 
values under randomization theory as given in Tables 1.5 and 1.6, respectively. 
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Table 1.5 E(MS) for T|B-ANOVA 

Source £'(MS) 

u j 

m 心 ^rryE^ + (n 似 .)/(， - 1) 

v ’ jt \ uv u !' 


Table 1.6 E(MS) for B|T-ANOYA 


Source 

E(MS) 

x T \^ 

t v > 9 . t (b _ Y ) v > 9 V \ > 2 

bk(t - 1) ^ U Jt b(b - 1)0 - 1) ^ 3 t-l^ Tw 

X^\3,X x 

t 一 k k 、 2 bk 一 t k y 2 

b(b- l)k(k - 1) ^ UJt + (b- l) 2 -y b] 

I\0,X^X T 

1 y u 2 u 

b{k-D^ ji 


If we define 

1 2 2 

b{k - 1 ) ^ Ujl = Gu 

v ’ ji 

and 

j 

we can then express the expected values for the three important mean squares in 
ANOVA Tables 1.5 and 1.6 as 

E [MS(X t I 3, Xp)] = al + ^~ ,uv Tuv ~ k Zu - (1.35) 

t _ 1 

E [MS(X p \0,X r )]^ + (1.36) 


£_(/|3，々，1)]=4 


(1.37) 
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We make the following observations: 

1. The quadratic form in the r uv in (1.35) is just a different way of writing t’Ct 
in Table 1.1. Both expressions indicate that the quadratic form depends on the 
particular design chosen, and both equal zero when all the treatment effects 
are the same. 

2. It follows from (1.35) and (1.37) that, based on the equality of the 五 (MS) 
under Ho : n = t 2 = • • • = r ? , the ratio 

MS(X r \3, X^)/MS(I\3,X^X r ) (1.38) 

provides a test criterion for testing the above hypothesis. In fact, Ogawa 
(1974) has shown that the asymptotic randomization distribution of (1.38) is 
an F distribution with t — l and n — t — b l d.f. We interpret this again 
to mean that the usual F test is an approximation to the randomization test 
based on (1.38). 

3. Considering (1.36) and (1.37), there does not exist an exact test for testing 
the equality of block effects. This is in agreement with our discussion in 
Section 1.9.2 concerning the asymmetry of treatment and block effects. 

4. For k = t and r = b, that is, for the RCBD, the results of Tables 1.5 and 1.6 
agree with those in Table 9.1 of Section 1.9.2. 

5. With treatment-unit additivity in the broad sense (see Section 1.6.3.3) the 

expressions in (1.35), (1.36)，and (1.37) are changed by adding cr^ + to 
the right-hand sides (recall that = cr^ = cr^). Remarks 

(2) and (3) above remain unchanged. 

6. For the recovery of interblock information (to be discussed in Section 1.7 )， 
we need to estimate a, (or a function of a 彦 ). Clearly, under the assumption 
of additivity in the broad sense, this cannot be done considering that 

E[MS(X^^X r )]^al + a} 1+ ^^ k al + ^al 
It is for this reason only that we shall resort to the approximation 

E[MS (XpU ， X T )]^a^ + ^ + ^ aj + ^ aj 

' (1-39) 

which is the expected value based on an infinite population theory model 
[see (1.49) and (1.50)]. 

For a different approach to randomization analysis, see Calinski and Kageyama 

(2000). ^ … 
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1.7 INTERBLOCK INFORMATION IN AN INCOMPLETE 
BLOCK DESIGN 

1.7.1 Introduction and Rationale 

As mentioned earlier, Yates (1939, 1940a) has argued that for incomplete block 
designs comparisons among block totals (or averages) contain some information 
about treatment comparisons, and he referred to this as recovery of interblock 
information. The basic idea is as follows. 

Consider, for purposes of illustration, the following two blocks and their obser¬ 
vations from some design: 



Block 1: 

J51, 

y3u 

yn 


Block 2: 

yn, 

J42, 

J32 

Let 

= 

J51 +J31 +J11 


and 

B 2 = 

J 22 + y42 + y32 



represent the block totals. Using model (1.1) we can write 

B\ — B2 = (T5 + T2 + ti) — (t2 + T4 + T3) 

+ 3/3i — 3 此 + (es\ + ^31 + e\\) 

— (^22 + ^42 + ^ 32 ) 

Assuming now that the are random effects with mean zero, we find 
E(B\ - B 2 ) = Ti + T5 - T2 - T4 

It is in this sense that block comparisons contain information about treatment 
comparisons. We shall now formalize this procedure. 

1.7.2 Interblock Normal Equations 

Consider the model equation (1.1) 

y = + X t t + X0 e 


where P is now assumed to be a random vector with E(P) = 0 and var(jS)= 
crj I. As pointed out above the interblock analysis is based on block totals rather 
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than on individual observations, that is, we now consider 



x' p y = 

h 

fi + N f r + Kfi + X r p e 

(1.40) 



WJ 

1 



We then have 

< 众 1 、 

£(Z^)= k ： 

\x + N't 

(1.41) 


W 


and the variance-covariance matrix under what we might call a double error 
structure with both P and e being random vectors 

var(Z^) = K 2 aj + Ka] 


K 


a _£ 


K 


(say) 


and 


L = diagf^y} = diag \ kj 


\ + i 


^7 I = dia § k i 


w 




diag{kjpj] 


with 


w 


(1.42) 


and 


w ： 


； + k i a j 


(1-43) 


and 


Pj 


w 




+ k J a l 


(1.44) 


The quantities w and w f j of (1.42) and (1.43) are referred to as intrablock and 
interblock weights, respectively, as w is the reciprocal of the intrablock variance, 
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and w r - is the reciprocal of the interblock variance, that is, var(B ; ) on a per 
observation basis, or \a.r(Bj/kj) = cr ^ kjcr^. We then use as our “observation” 

z = L-(1.45) 


vector 


which has 


var(z) = laj 


and hence satisfies the Gauss-Markov conditions. 
If we write (1.41) as 


E(X^y) = (kN f ) 




with k = (k\ ， k 2 , … ,kb ) f ， we have from (1.45) that 


E(z) = L~ l/2 (kN , ) 




The resulting NE, which we shall refer to as the interblock NE, is then given by 




L~\k N') 






L- l X' p y 


(1.46) 


or explicitly as 




k ± 






E 


^j_ ▽ n ^j n tj 




j J 

n 'j n tj 










(1.47) 


It can be seen easily that the rank of the coefficient matrix in (1.47) is t. To solve 
the interblock NE, we take /x* = 0 and hence reduce the set to the following t 
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equations in r*, r|,..., r* where we have used the fact that tj = kj pj : 


孓 〜 j 


^ pj 1 

/ K J 

^ pj 1 

^4 - 

J 

^ pJ 1 

■- — 

^ pj 1 

1 > 抑吝 . 

2 P j 1 

■- ^T~ 




▽ pj 1 

j 7 

^ pj 1 
H— n 2 J B j 
j 7 

.j J 


The solution to the equations (1.48) is referred to as the interblock information 
about the treatment effects, with 


E(jf) = / 1 -\~ n const.. y^(/x + Tf) 


Hence 



We note here that typically (see Kempthome, 1952) the interblock analysis 
is derived not in terms of the “observations” z [as given in (1.45)] but rather 
in terms of the block totals X^y. The resulting NE are then simply obtained 
by using L = / in (1.46) and subsequent equations. The reason why we prefer 
our description is the fact that then the intra- and interblock information can be 
combined additively to obtain the so-called combined analysis (see Section 1.8) 
rather than in the form of a weighted average (see Kempthome, 1952). 
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1.7.3 Nonavailability of Interblock Information 

We conclude this section with the following obvious remarks: 

1. For the special case kj = t for all j, r[ = b for all i and all ntj = 1, we 
have, of course, the RCBD. Then the elements in the coefficient matrix of 
(1.48) are all identical, and so are the right-hand sides. Consequently, (1.48) 
reduces to a single equation 

b H 和 

i j 

and no contrasts among the r/ are estimable. Expressed differently, any con¬ 
trast among block totals estimates zero, that is, no interblock information is 
available. 

2. For a design with b < t (and such incomplete block designs exist as we shall 
see in Chapter 4; see also the example in Section 1.7.1), the rank of the 
coefficient matrix of (1.48), NL~ l N\ is less than t. Hence not all \x + r/ 
are estimable using block totals, which means that interblock information is 
not available for all treatment contrasts. 


1.8 COMBINED INTRA- AND INTERBLOCK ANALYSIS 

1.8.1 Combining Intra- and Interblock Information 

The two types of information about treatment effects that we derived in Sections 

1.3.2 and 1.7.2 can be combined to yield the “best” information about estimable 
functions of treatment effects. All we need to do is to add the coefficient matrices 
from the intrablock RNE (1.8) and the interblock NE (1.48) and do the same 
for the corresponding right-hand sides. This will lead to a system of equations 
in r**, T^*, … ， rf*, say, and the solution to these equations will lead to the 
combined intra- and interblock estimators for treatment contrasts. 

In the following section we shall derive the equations mentioned above more 
directly using the method of generalized least squares, that is, by using the Aitken 
equations described in Section 1.4.16.2 

1.8.2 Linear Model 

In order to exhibit the double error structure that characterizes the underlying 
assumptions for the combined analysis, we rewrite model (1.1) as 

yji ^ + + Pj +eji (1.49) 

where j = 1 ， 2,... ， b.，I = 1,2,kj ； Xjt denotes the effect of the treatment 
applied to the Ith experimental unit in the yth block, the Pj are assumed to 
be i.i.d. random variables with E(^j) = 0, var(/3 y ) = and the ej£ are i.i.d. 
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random variables with E{ejt) = 0, var(^^) = o^. We then have 

^ji) = ~h 

and 

# + f or J. = /’ ( - = ^' 


(1.50) 


cov (yji, yjfc) 


o} for j = f，t 參 if 

0 otherwise 


(1.51) 


To use matrix notation it is useful to arrange the observations according to blocks, 
that is, write the observation vector as 


y = (^11^12,... ， : yih ， ;y2i ， 722 , . • • ， yb\^yb2, •.. ， ybk b ) r 


Letting 


X = (0 X r ) 


we rewrite (1.50) as 


E(y) = X 


'/X、 


(1.52) 


and the variance - covariance (1.51) as 

>1 

I 

var(j) 


V 2 0 

0 v b 


^2 _ t /^2 


(1.53) 


where V j is given by 


V J = Ik i + 3k i d 'kj 


(1.54) 


1.8.3 Normal Equations 

Applying now the principles of least squares to the model (1.52) with covariance 
structure (1.53) yields the Aitken equations (see Section 1.4.16): 


(X f V~ l X) ^ = X f V~ x y 


(1.55) 


where 
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(1.56) 


(1.57) 


_ (i 

x'v~ l = x f - 

and 

x'v~ l x 

=IX 



-P 尔 2 … 



<52^12^ 2 … 

hn\b^ kb 


^t2^ f k 2 … 

^b n tb^k b 

IX 

IX . 

… IX 

r i-J2 s j n v 

-jyj n ” n 2j . 

■- -J2 s J nl j nt j 

-J2 s j ni j n 'j 

-^2 S j n 2jn,j . 

… r t-J2 8 J n ^ 


(1.58) 
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By inspection one can verify that in the coefficient matrix (1.58) the elements 
in rows 2 to f + 1 add up to the elements in row 1, which shows that (1.55) is 
not of full rank; in fact, it is of rank t. The easiest way to solve these equations 
then is to impose the condition /x = 0. This means that we eliminate the first 
row and first column from (1.58) and the first element in (1.59) and solve the 
resulting system of t equations in the t unknowns t^i,. •., If we define 
S = diag(p》，then this system of equations resulting from (1.58) and (1.59) can 
be written as 

R — NK_ 乂 I — S _1 ) r = T - NK_ 乂 I — S _1 ) B (1.60) 

which we write for short as 

Ar = P (1.61) 

with A and P as described above. The solution then is 

r = A _1 P (1.62) 

or, in terms of a generalized inverse for the original set of NE (1.55) 

1 ] = [2 : _ 

with 

E (Ti) = Ti (i = 1, 2, . . . , f) 

If we denote the (/,〆）element of A~ 1 by a ll \ then 

var = var (?/ _ $/) = (a n + 〆〆 _ 2a 11 ) cr 》 (1.64) 

More generally, the treatment contrast c’r is estimated by with variance 


var (c r r) = c' A~ x co^ 


(1.65) 
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Expression (1.65) looks deceptively simple, but the reader should keep in mind 
that the elements of A~ l depend on cr^ and o^. We shall return to estimating 
(1.65) in Section 1.10. 

Finally, we note that the equations (1.60) show a striking similarity to the 
intrablock NE (1.7), except that the system (1.60) is of full rank and the elements 
of its coefficient matrix depend on the unknown parameters cr^ and cr^. 


1.8.4 Some Special Cases 

As a special case of the above derivations we mention explicitly the equireplicate, 
proper design, that is, the design with all r/ = r and all kj = k. We then define 


a e + ka j 


and write (1.60) as 



NN f 


T 



-p~ 1 \nb 


( 1 . 66 ) 


(1.67) 


( 1 . 68 ) 


We shall comment briefly on the set of equations (1.68) for two special cases: 

1. If p— 1 = 0， that is, cr^ = oo, then (1.68) reduces to the NE (1.7) for the 
intrablock analysis. This means, of course, that in this case no interblock 
information is available. This is entirely plausible and suggests further that 
for “large” g 、the interblock information is very weak and perhaps not worth¬ 
while considering. 

2. If p~ l = 1, that is, cr^ = 0, the solution to (1.68) is the same as that obtained 
for the completely randomized design (CRD) with the restriction /I = 0. This, 
of course, is a formal statement and should not imply that in this case the 
observations should be analyzed as if a CRD had been used. 


1.9 RELATIONSHIPS AMONG INTRABLOCK, INTERBLOCK, AND 
COMBINED ESTIMATION 


On several occasions we have pointed out that there exist certain relationships 
among the different types of analysis for incomplete block designs. It is worth¬ 
while to exposit this in a little detail. 
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1.9.1 General Case 
For the full model 


y = + + ^ + ^ 

with the double error structure we have 


五 (j) = 3/x + X t r 
and 

var(_y) = XfiX» lc^ 

= (/ + yXpX^ 



as in (1.53) with y = cr^/cr^. Then, as explained in Section 1.8, the estimators 
of estimable functions are obtained from the Aitken equations by minimizing 

(y-Ofi- X T r) f V~\y - 3/x - X T r) (1.69) 


with respect to /i and r. To simplify the algebra, let 0 = 3/x + X r r and then 
write ♦ = y — Q. Expression (1.69) can now be written simply as 一 ' 伞 . We 
then write 


jjr = Pj/t + (/ — P)i/r 
where, again for brevity, we write 

p = p x ^x fi (x , fi x fi r l x , fi 

= X p K~ l X , fi 


Then 

f'v~ x f = [f'P + f\l - P)]V~ l [Pf + (/ - P)f] 


(1-70) 


(1-71) 


In order to expand the right-hand side of (1.71) we make use of the following 
results. From 


(/- P)V = (/- P)(/ + yX^) = {i-P) 
[using (1.70)] it follows that 


(I - P)V~ l =(I -P) 
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and hence 

(I — P)V~ l P = (/ - P)P = 0 

Thus, (1.71) reduces to 

= f'PV~ l Pf + f\I - P)f (1.72) 

To handle the term x/r'PV~ l Pi/r we note that V~ l in (1.57) can be rewritten as 
V x =l - yX fi (I + yX^Xpr 1 ^ 

so that 

PV l P = P- yXfi(I + YX'pX^X^ 

=X P \k~ x -Yd + YKT^X'^ 

= Xf i (K + YK 2 r l X' ti 

Hence 

f'PV-ipf = (x'pir^ (k + yK 2 y l (x'^ (1.73) 

Then we note that X^yjr is the vector of block totals of the vector 少 . Since 
j — 0, we have 

var(X>) = var 

aj + X'pX^ 

=(K + VK 2 ) 

Hence PV~ l Pi/r is equal to 

E(B)]' (^K + yK 2 Y'[B- E(B)] (1.74) 

with B = X^y being the vector of block totals. 
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The second expression in (1.72) is actually the quadratic form that needs to be 
minimized to obtain the RNE for 0 (see Section 1.4.7.1)，and this is equivalent 
to minimizing 


Qi^(y~ X x r)\l - P)(y- X T z) (1.75) 

We thus summarize: To obtain the intrablock estimator, we minimize to 
obtain the interblock estimator, we minimize Q^\ and to obtain the combined 
estimator, we minimize Q\ + Q:. 


1.9.2 Case of Proper, Equireplicate Designs 

We now consider the case with R = rl and K = kl. The intrablock NE are [see 
(1.7)] 

(r/ _ 去顯 ，)t = Q 

For the interblock observational equations 

B = kOfi + N f T + error 

or, absorbing fi into r, that is, replacing 3/x + r by r, 

B = N f r + error 


we have the interblock NE [see (1.47)] 

NN f r* = NB 


As we have pointed out earlier (see Section 1.8) and as is obvious from (1.72), 
the combined NE are 


rl - -NN r 

K k j 


1 


k(l + yk) 


NN r 


e + 


i 


k(l + yk) 


NB 


(1.76) 


The form of (1.76) shows again two things: 


1. The intrablock and interblock NE are related. 

2. The matrix NN f determines the nature of the estimators, both intrablock and 
interblock. 
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The properties of N N’ can be exploited to make some statements about 
estimable functions of treatment effects. Being real symmetric, NN f is orthogo¬ 
nally diagonalizable. We know that 

(rl - 去而 ，) 3 = 0 


or 


NN f 3 = rk3 


so that one root of NN f is rk 三 8 t , say, with associated eigenvector 0 = 

与 t ， say. Suppose L ，矣 2 ,... ，与 complete the full set of orthonormal eigenvec¬ 
tors with associated eigenvalues 81 , 82 ,, ^-i* Then with 

o = (H...U 

and 


履 m 


the NE are equivalent to 


and 



OO r = O'Q 


ONNOOr* = ONE 


respectively. If we write 


vi 

V2 

Vt 


O't 


then Eq. (1.77) and (1.78) reduce to 


and 




'Vi = at 



=bi 


(1.77) 


(1-78) 


respectively, where 


a = (ai, «2,..., at)' = O’ Q 
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and 


b = (b\ ， b2 ,..., bt)' = O’NB 

We see then that we have both intrablock and interblock estimators of Vi if 5/ 
is not equal to 0 or to rk. For the component v t , only the interblock estimator 
exists. If other roots are equal to rk, then the intrablock estimators for the corre¬ 
sponding treatment parameters do not exist. Similarly, if other roots are equal to 
zero, then the interblock estimators for the corresponding treatment parameters 
do not exist. The treatment parameters vi ， V2 , …， ^t-1 are necessarily treatment 
contrasts. If the design is connected, then no 5 / (i = 1,2,..., t — 1) will equal rk. 
The combined NE (1.76) are now transformed to 

I ( r - — ) + --- 8i % = a t + ——-—— （/ = 1 ， 2, ... ，，） （ 1.79) 

LV k) k(l + yk) l j k(l^yk) 

We know that 

var(g)= (rl — 


and 


Hence 


and 


wsir(NB) = N var(5)^ 

=N ^{\ + yk)IN r 


var(a) = o] 


diag ( r - 1) 


var(ft) = k(\ + ]/ 々 )diag(<^) 


So we see from (1.79) that combined estimation of the parameter vector v consists 
of combining intrablock and interblock estimators of components of v, weighting 
inversely as their variances. 


1.10 ESTIMATION OF WEIGHTS FOR THE COMBINED ANALYSIS 


The estimator for the treatment effects as given by (1.61) depends on the weights 
w and w r - as can be seen from (1.58) and (1.59). If these weights were known, 
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or alternatively as is apparent from (1.68)，if the ratios of the interblock variance 
and the intrablock variance, 


W + k i a l 

Pj = ~7 = 2 

w j 〜 2 

were known, then the solution (1.63) to the NE (1.55) would lead to best linear 
unbiased estimators for estimable functions of treatment effects. Usually, how¬ 
ever, these parameters are not known and have to be estimated from the data. If 
the estimators are used instead of the unknown parameters, then the solutions to 
the normal equations (1.55) lose some of their good properties. It is for this rea¬ 
son that the properties of the combined estimator have to be examined critically, 
in particular with regard to their dependence on the type of estimator for the 
Pj\ and with regard to the question of how the combined estimator compares 
with the intrablock estimator. Before we discuss these questions in some more 
detail, we shall outline the “classical” procedure for estimating the pj. Since 
this method was proposed first by Yates (1940a) we shall refer to it as the Yates 
procedure or to the estimators as the Yates estimators. 


1.10.1 Yates Procedure 

One way to estimate w and w f j is to first estimate and and then use these 
estimators to estimate w and w f j. If the estimators are denoted by o^ 2 and 
respectively, then we estimate w and w f j as 


w : 




+ k i^j 


(j = 1，2’...， 幻 


Obviously, from Table 1.1 


a^ = MS(I\3,X p ,X r ) 


(1.80) 


(1.81) 


To estimate a 彦 we turn to Table 1.2. Under model (1.50) with covariance structure 
(1.51) we find [see also (1.39)] 


E[SS(X^\0,X T )] = (b-l)a^ <4 

r i 


Hence it follows from (1.81) and (1.82) that 




b- 1 




[ss(Ain r ) - ss(d 々， X T )] 


(1.82) 


(1.83) 
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The estimators (1.81) and (1.83) are then substituted into (1.80) to obtain w and 
Wj(j = 1,2,b). If in (1.83) < 0 for a given data set, we take 



1 

MS(I\0,X^X r ) 


In either case w and w f - are substituted into (1.58) and (1.59) and hence into 
the solution (1.61). Also, var (r/ — T") is estimated by substituting w, w f j and 
into (1.62) and (1.64). 

For alternative estimation procedures see Section 1.11, and for a numerical 
example see Section 1.14.3. 


1.10.2 Properties of Combined Estimators 

As we have already pointed out, the fact that the unknown parameters in (1.61) 
are replaced by their estimators will have an effect on the properties of the 
estimators for treatment effects. The two properties we are concerned about here 
are unbiasedness and minimum variance. 

Let c't be an estimable function of the treatment effects, let t = c'^i be its 
intrablock estimator, t(p) = c'^x its combined (Aitken) estimator with p known 
(for the present discussion we shall confine ourselves to proper designs), and 
t(p) = c'x the combined estimator when in (1.68) p is replaced by p = w/w f . 

Roy and Shah (1962) have shown that for the general incomplete block design, 
although the Yates procedure leads to a biased estimator for p, the estimators for 
treatment contrasts obtained by the method just described are unbiased, that is, 

E[t{p)] = C r T 

With regard to var[?(p)], it is clear that due to sampling fluctuations of we 
have 


var[r(p)] < var[?(p)] 

that is, the combined estimators no longer have minimum variance. The crucial 
question in this context, however, is: When is var[?(p)] < var(0? In other words: 
When is the combined estimator more efficient than the intrablock estimator? 

The answer to this question depends on several things such as (1) the true 
value of p, (2) the type of estimator for p, and (3) the number of treatments and 
blocks. It is therefore not surprising that so far no complete answer has been 
given. 

The most general result for the Yates estimator (and a somewhat larger class 
of estimators) is that of Shah (1964) based upon some general results by Roy 
and Shah (1962) [see also Bhattacharya (1998) for proper designs]. It is shown 
there that the combined estimator for any treatment contrast in any (proper) 
incomplete block design has variance smaller than that of the corresponding 
intrablock estimator if p does not exceed 2, or, equivalently, if < cr^/k. This 
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is a condition that, if blocking is effective at all, one would not in general expect 
to be satisfied. The problem therefore remains to find methods of construct¬ 
ing estimators for p such that the combined estimator for treatment contrasts is 
uniformly better than the corresponding intrablock estimator, in the sense of hav¬ 
ing smaller variances for all values of p. For certain incomplete block designs 
this goal has been achieved. We shall mention these results in the following 
chapters. 

The only general advice that we give at this point in conjunction with the use of 
the Yates estimator is the somewhat imprecise advice to use the intrablock rather 
than the combined estimator if the number of treatments is “small.” The reason 
for this is that in such a situation the degrees of freedom for MS(/|3, X r ) 

and X x ) are likely to be small also, which would imply that and 

cr^, and hence p, cannot be estimated very precisely. 


1.11 MAXIMUM-LIKELIHOOD TYPE ESTIMATION 


In this section we discuss alternatives to the Yates procedure (see Section 1.10) 
of estimating the variance components a, and for the combined analy¬ 
sis. These estimators are maximum-likelihood type estimators. This necessi¬ 
tates the assumption of normality, which is not in agreement with our under¬ 
lying philosophy of finite population randomization analysis. The reason for 
discussing them, however, is the fact that they can easily be implemented in 
existing software, in particular SAS PROC MIXED (SAS, 1999-2000) (see 
Section 1.14). 


1.11.1 Maximum-Likelihood Estimation 

It is convenient to rewrite model (1.49) with its covariance structure (1.51) in 
matrix notation as follows: 


y = + X T t + X pp + e 

=XocUP+ e (1.84) 

where Xa represents the fixed part, with X = (3 X r ), a! = (/ z, r f ), and Ue 
represents the random part. Thus 

E(y) = Xa 
and 

var(j) = UU f aj + I n a^ 


2 


(1.85) 
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with V = yUU f + I n and y = [see also (1.53) ， (1.54)]. We then assume 

that 

y ~ N n (Xa, (1.86) 

that is, y follows as a multivariate normal distribution (see 1.4.17.1). 

The logarithm of the likelihood function for y of (1.86) is then given by 

入 =.-\n log n - \na} - \ log| V| - 士 (j — X a yV~ l (y - Xa)/a^ (1.87) 

Hartley and Rao (1967) show that the maximum-likelihood (ML) estimator of a, 
y, and cr 》are obtained by solving the following equations: 

X,(X'V~ l y - X f V~ l Xa) = 0 (1.88) 

a e 

--tr(V~ l UU f ) + - Xa)'V- l UU'y-\By - Xa) = 0 

2 2a2 

- £2 + - ^«) = 0 

where tr(A) represents the trace of the matrix A. 

The basic feature of this method is that the fixed effects and the variance com¬ 
ponents associated with the random effects are estimated simultaneously in an 
iterative procedure. We shall not go into the details of the numerical implemen¬ 
tation (see, e.g., Hemmerle and Hartley, 1973)，but refer to the example given in 
Section 1.14.4 using SAS. 


1.11.2 Restricted Maximum-Likelihood Estimation 

Specifically for the estimation of weights for the purpose of recovery of interblock 
information, Patterson and Thompson (1971) introduced a modified maximum- 
likelihood procedure. The basic idea is to obtain estimators for the variance 
components that are free of the fixed effects in the sense that the likelihood does 
not contain the fixed effect. Operationally this is accomplished by dividing the 
likelihood function (1.87) into two parts, one being based on treatment contrasts 
and the other being based on error contrasts, that is, contrasts with expected value 
zero. Maximizing this second part will lead to estimates of functions of cr^ and 
o^. Because of the procedure employed, these estimates are by some referred 
to as residual maximum-likelihood estimates (REML), by others as restricted 
maximum likelihood estimates (REML). The latter name is derived from the 
fact that maximizing the part of the likelihood free of the fixed effects can be 
thought of as maximizing the likelihood over a restricted parameter set, an idea 
first proposed by Thompson (1962) for random effects models and generalized 
for the general linear mixed model by Corbeil and Searle (1976), based on the 



MAXIMUM-LIKELIHOOD TYPE ESTIMATION 


41 


work by Patterson and Thompson (1971). We shall give below a brief outline of 
the basic idea of REML estimation following Corbeil and Searle (1976). 

Consider model (1.84) and assume that the observations are ordered by treat¬ 
ments, where the ith treatment is replicated r/ times (/ = 1,2,, t). Then the 
matrix X can simply be written as 

~0 n 

0 r2 0 
z= • 

_0 o n 

To separate the log-likelihood function (1.87) into the two parts mentioned above, 
we employ the transformation (as proposed by Patterson and Thompson, 1971) 


=°n (1-89) 

i=l 


(1.90) 

where 

S= I - X(X'X)~ l X' 

t 

=E + 

i=l 

is symmetric and idempotent. Furthermore, SX = 0, and hence Sy is distributed 
A^(0, SVSa^) independently of X'V~ l y. 

It follows from (1.91) that S is singular. Hence, instead of S we shall use 
in (1.90) a matrix, T say, which is derived from S by deleting its rith, + 

r 2 )th, (ri + 厂 2 + r 3 )th, .. (r\ + r 2 H - h r t )th rows, thereby reducing an n x n 

matrix to m (n — t) x n matrix (with n — t representing the number of linearly 
independent error contrasts). More explicitly, we can write T as 

t r 1 -j 

T=T + i n -x '■ o r; _i - -u; 

L ri 」 

=E + 

i=l 

It follows from (1.89) and (1.92) that 


-1 - 5r,-l ^r；-l - - 


n 


(1.92) 


l n - —^n^r 

V r i 


(1.91) 


rx = o 


(1.93) 
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Considering now the transformation 


it follows from (1.86) and (1.93) that 


z ^ N 


.X'V^Xa, 


X'\~ x Xa] 


Clearly, the likelihood function of z consists of two parts, one for Ty, which 
is free of fixed effects, and one for X f V~ x y pertaining to the fixed effects. In 
particular, the log likelihood of Tj then is 

= ^(n — t)log 2 tt — — t)log 

-去 log \TVT f \- \y , T , {TVT , )- l y/a^ (1.95) 

The REML estimators for y = and 〜 2 are obtained by solving the 

equations 


The resulting equations have no analytic solutions and have to be solved itera¬ 
tively. We denote the solutions, that is, estimates by y and respectively. 

The fixed effects, represented by a in (1.84), can be estimated by considering 
the log likelihood of X f V~ l y, which is given by 

A，2 ―—— 士 /log 27 T _ 士 /■ log 

log 


(y - Xa)' V- l X(X'V- l X)- l X'V~ l (y - Xa)/a; (1.98) 


Solving 


leads to the estimator 


oc = (X'V- l X)- l X'V~ l y 


(1-99) 
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This, of course, assumes that V is known. Since it is not, we substitute y from 
(1.96) and (1.97) for y in (1.99) and obtain the estimate 

a = ( 1 . 100 ) 

where V denotes V with y replaced by y. 

An approximate estimate of the variance of a is given by 

w(S) = (1.101) 

For a numerical example using REML see Section 1.14.4. 


1.12 EFFICIENCY FACTOR OF AN INCOMPLETE BLOCK DESIGN 

We have seen in Sections 1.9.3 and 1.10.2.9, for example, how we can compare 
different error control designs with each other by using the notion of relative 
efficiency. In this case, we compare two error control designs after we have per¬ 
formed the experiment using a particular error control design. For example, after 
we have used an RCBD we might ask: How would we have done with a corre¬ 
sponding CRD? In other cases, however, we may want to compare error control 
designs before we begin an experiment. In particular, we may want to compare 
an incomplete block design (IBD) with either a CRD or an RCBD, or we may 
want to compare competing IBDs with each other. For this purpose we shall use 
a quantity that is referred to as the efficiency factor of the IBD. It compares, 
apart from the residual variance, cr^, the average variance of simple treatment 
comparisons for the two competing designs. 


1.12.1 Average Variance for Treatment Comparisons for an IBD 

Let us now consider 

av. var(r/ — r；/) (1.102) 

for a connected IBD. We know, of course, that (1.102) is a function of C _ , a 
generalized inverse of the C matrix. Suppose now that all the block sizes are 
equal to k. Then we have 


C = R- -NN f 
k 
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and we know that C has one zero root, d t = 0 say, with associated normalized 
eigenvector Let the other roots be 办， ... ， d t -\ with associated 

orthonormal eigenvectors 夸 Then 


and from 


it follows that 


^C = di^ = 

%\Cx = d^x 


卜 # Q 


and 


var(^-r) = 


di 




(1.103) 


Using the fact that = (l/^/t)0, that H … ， ^t-i are mutually perpendic¬ 
ular and perpendicular to and that 

K = 1 


we have with z' = (zi ， Z 2 , … ， Zt) 
t-i 


E ($;_z ) 2 = z’K) z 

i=l \i=l / 

=z\I - ^ t )z 


E 


(s 


Zi 


fe-z ) 2 

i=l 

It is also easy to verify that 

v ’ i=\ 


(1.104) 


(1.105) 
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Taking zi = % — Ti, substituting into (1.105) using (1.104) and then taking expec¬ 
tations and using (1.103), yields for (1.102) 

2 r_1 i 

av. var(r ； — 节 '）=V — a] (1.106) 

i+i' t - \ ^ di 


1.12.2 Definition of Efficiency Factor 

It is natural in attempting to evaluate the efficiency of an IBD to compare it 
with a CRD since this is always a possible competing design. For a CRD with 
ri replications for treatment /, the average variance of treatment differences is 

_ _ / 1 1 \ 9 2 2 

f^： r var(r/ - r//) = ^av. ^(crd) = \(crd) (1.107) 

where is the harmonic mean of the r/, that is, 

1 1 ^ 1 

rh t ^ r t 

i 

We shall digress here for a moment and show that the best CRD is the one 
with all ri = r, and that is the design with which we shall compare the IBD. For 
this and later derivations we need the “old” result that the harmonic mean of a 
set of positive numbers is not greater than the arithmetic mean. It seems useful 
to give an elementary proof of this. 

Let the set of numbers be [xi, i = 1,2,, m}. Consider the quadratic 

m / 

q(P) = ^ P 

Clearly q(fi) > 0 for all p. The minimizing value of ^ is obtained by using least 
squares which gives the NE 





m 


The minimum sum of squares is 


Xi — 卩 m 


Hence 


J2 Xi 


m 


> 0 
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or 



or 

— >1 

X h - 

with equality if and only if X[ = jc for all i. 

This result implies that the best CRD will have r[ = r and r = n/t where n is 
the total numbers of EUs. This can happen, of course, only if n/t is an integer. 
If n/t is not an integer so that n = pt q (0 < q < t), then the best CRD will 
have q treatments replicated p l times. 

Consider now the case of an IBD with b blocks of size k and r/ replications 

for the /th treatment. Then the total number of EUs is n = bk = ^2 Suppose 

also that n = rt, so that an equireplicate CRD is possible. The average variance 
for such a design is 2 〜 1 2 3 4 ( CRD )/r，whereas the average variance for the IBD is 
2 o^( ibd )/c where, as shown in (1.106), c is the harmonic mean of the positive 
eigenvalues of (^R — (l/k) NN ’、(see Kempthome, 1956). It is natural to write 
c = rE, so that with ^crd) = ^ibd) we have 

av. var(r ； — tJOcrd 2/r 

- / 丄 =TTT = E (1-108) 

av. var(r,- - t,/)i B d 2/r£ 

The quantity E thus defined is called the efficiency factor of the IBD. It is 
clearly a numerical property of the treatment-block configuration only and hence 
a characteristic of a given IBD. 

We add the following remarks: 

1. The same definition of E in (1.108) could have been obtained by using the 
average variance for an RCBD with b = r blocks instead of the average 
variance for an equireplicate CRD assuming that °^(rcbd) = a e{mD ) - 

2. Although £ is a useful quantity to compare designs, it does not, of course, 
give the full story. It compares average variances only under the assump¬ 
tion of equality of residual variances, whereas we typically expect 〜 2 (ibd) < 

°^ 2 (crd) an d ^(ibd) < °^ 2 (rcbd). 

3. The efficiency factor pertains only to the intrablock analysis and ignores the 
interblock information. 

4. Each IBD will have associated with it an efficiency factor E. In order to 
compare two competing IBDs with the same n and with efficiency factors 
E\ and E 2 , respectively, we would typically choose the one with the higher 
E value. 
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1.12.3 Upper Bound for the Efficiency Factor 

Using again the fact that the harmonic mean of positive numbers is not greater 
than the arithmetic mean, we have 


(t - l)c < d i = trace (/? — 

=ii r '-\n n l 

/=1 ij 

=« - 江 4 d-109) 

ij 

The largest value of the right-hand side of (1.109) is obtained for the smallest 
value of ^2- njj. Since n" is one of the numbers 0, 1, 2,..., the minimum 
value of njj will be achieved when n of the nifs are 1 and the remaining are 
zero. Since then nfj = nij and n ij = ^ follows from (1.109) that 

E l x —、 k _ 1 _ 

r i = ~j^ tr 



( 1 . 110 ) 


( 1 . 111 ) 


We shall see later (see Chapter 2, also Section 1.9.8.2) that the upper bound given 
in (1.110) will be achieved for the balanced incomplete block design. 

Sharper upper bounds for certain classes of IBDs are given by Jacroux (1984), 
Jarrett (1983), Paterson (1983), and Tjur (1990); see also John and Williams 
(1995). 
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1.13 OPTIMAL DESIGNS 

We have argued in the previous section that in order to compare two designs, d\ 
and say, we may consider their efficiency factors E\ and E 2 , respectively, and 
choose the design with the higher efficiency factor. In particular, if the efficiency 
factor of one of those designs achieves the upper bound for that class of designs, 
we would consider that design to be optimal in some sense. Such considerations 
have led to the development of the notion of optimal designs and to various 
criteria for optimality. We shall describe briefly some of these criteria. 

1.13.1 Information Function 

Initial contributions to the formal discussion of optimal designs were made by 
Wald (1943) and Ehrenfeld (1953). Extending their results, Kiefer (1958) pro¬ 
vided a systematic account of different optimality criteria. These can be discussed 
either in terms of maximizing a suitable function of the information matrix or 
minimizing a corresponding function of the dispersion matrix of a maximal set 
of orthonormal treatment contrast estimates. 

In the context of our discussion the information matrix is given by C in (1.9). 
Let P'x represent a set of ^ — 1 orthonormal contrasts of the treatment effects. 
Using intrablock information from a connected design d, the estimator for P'x is 
given by P r< t with r' from (1.18). The dispersion matrix for P f< c is then given by 

v d 0-2 = P'C- d Pa] = (P'C d P)- l cr^ (1.112) 

[see (1.20)], where C 】and hence Cj refer to the specific design d used. The 
information matrix for the design d is then defined as 

C* d = (P'C- d P)~ x (1.113) 

which shows, of course, the connection between CJ and Cd. 

An information function or optimality criterion is then a real-valued function 
0 that has the following properties (see Pukelsheim, 1993): 

1. Function 0 is a monotonic function; that is, an information matrix C* is at 
least as good as another information matrix D* if 0(C*) > 0(D*); 

2. Function 0 is a concave function, that is, 0[(1 — a) C* + aD*] = (1 — 
a)0(C*) + a 0(/)*) fora G (0, 1); 

3. Function 0 is positively homogeneous, that is, 0(5C*) = <5 0(C*). 

Condition (2) says that information cannot be increased by interpolation. And 
condition (3) says that even if we define the information matrix to be directly 
proportional to the number of observations, n, and inversely proportional to o^ 2 , 
that is, the information matrix is of the form (n/o^)C*，we need to consider 
only C*. 
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Let D be the set of competing designs. The problem of finding an optimal 
design J in D can then be reduced to finding a design that maximizes 0(CJ) 
over J in D (see Cheng, 1996). Such a design is called -optimal. 

As indicated above, an alternative, and historically original, approach to find¬ 
ing an optimal design is to consider minimization of some convex and non¬ 
increasing function O of dispersion matrices, as indicated by the relationship 
between (1.112) and (1.113). Accordingly, we shall then talk about a O-optimal 
design. 

1.13.2 Optimality Criteria 

Several optimality criteria, that is, several functions 0 or O have been considered 
for studying optimal designs. These criteria can be expressed conveniently in 
terms of the eigenvalues of or, equivalently, the nonzero eigenvalues of Cj, 
say Mji > MJ2 > > Ad ， t-i. 

The most commonly used optimality criteria are D-, A-, and 五 -optimality ， 
which maximize the following information functions: 

1. D-optimality: Determinant criterion or 

t-i 

= \\^di 
i=\ 

2. A-optimality: Average variance criterion or 



3. E-optimality: Smallest eigenvalue criterion or 

In terms of the corresponding O function, these optimality criteria can be 
expressed as minimizing 


t-i 

1. <D(V rf ) = det V d = Y\ ^~ d ! 


t~\ 

2. O(Vj) = tvV d = Y J ^~ d l 



(1.114) 


(1.115) 


3. = maximum eigenvalue of (1.116) 
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The statistical meaning of these criteria is that minimizing (1.114) minimizes 
the generalized variance of (1.115) minimizes the average variance of the 
set P’T，and (1.116) minimizes the maximum variance of a single normalized 
contrast. 

1.13.3 Optimal Symmetric Designs 

There exist special classes of designs for which all the nonzero eigenvalues of 
the information matrix Cj are equal. Such designs are called symmetric designs. 
Examples of symmetric designs are balanced incomplete block designs (Chapter 
2), Latin square designs (Section 1.10.2), Youden squares (Section 1.10.5), and so 
forth. The information matrix of a symmetric design is of the form al -\-b 33 ’， 
which is referred to as a completely symmetric matrix. 

In general, if a design is Oi optimal, it may not be O 2 optimal for two 
different optimality criteria Oi and O 2 . However, for symmetric designs Kiefer 
(1958) showed that they are A-, D-, and 五 -optimal. This led to the definition of 
universal optimality (Kiefer, 1975a) or Kiefer optimality (Pukelsheim, 1993). 

Definition 1.1 (Kiefer, 1975a) Let be the set of all t x t nonnegative 
definite matrices with zero row and column sums, and let O be a real-valued 
function on such that 

(a) O is convex, 

(b) 0(5C) is nonincreasing in the scalar 5 > 0, and 

(c) O is invariant under each simultaneous permutation of rows and columns. 

A design d* is called universally optimal in D if minimizes O(Cj) for every 
O satisfying conditions (a), (b), and (c). □ 

To help identify universally optimal designs we have the following fundamen¬ 
tal theorem. 

Theorem 1.2 (Kiefer, 1975a) Suppose a class C = {Cj, J g D} of matri¬ 
ces in 队 ， 0 contains a Cd* for which 

(a) Cd* is completely symmetric, and 

(b) tr C d * = maxj e d tr C d , 

Then d* is universally optimal in D. 

1.13.4 Optimality and Research 

We have just discussed the notion of design optimality and some of the avail¬ 
able optimality criteria. Other criteria have been introduced in other contexts, in 
particular, in the area of regression or response surface designs (see, e.g. ， Atkin¬ 
son and Donev ， 1992). And thus optimality has become a powerful concept, but 
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we need to remember that, although it has statistical meaning, it is a mathemat¬ 
ical concept. It has a definite and restricted connotation and it may be difficult 
to apply it in the larger context of designing a “very good” experiment for a 
researcher who has a scientific or technological problem. 

An immediate difficulty is that there is no simple classification of real prob¬ 
lems. There are discovery problems, for example, finding the point in factor 
space at which yield is maximum. There are exploration problems, for example, 
to obtain a “good” representation of the nature of the dependence of a noisy 
dependent variable on a given set of independent variables. There is the math¬ 
ematical problem in that same context that the dependence is known to be of 
a definite functional form with some specified but unknown parameters, which 
are to be determined from observations at some locations in the factor space. 
A common problem in technology and some scientific areas is what is called 
screening of factors. It is useful and important to think about this overall picture 
because there is a tendency to interpret the term “optimality of design” in a very 
limited context, a context that is very valuable, but misleading, in the sphere of 
total human investigation. 

The moral of the situation is multifold: (1) Researchers have to make a choice 
about problems and often work on unrealistic ones as the closest workable approx¬ 
imation to real live problems and should not be criticized for so doing; (2) almost 
any optimality problem is to some extent artificial and limited because criteria 
of value of designs must be introduced, and in almost any investigative situation 
it is difficult to map the possible designs valuewise into the real line; and (3) 
a solution to a mathematically formulated problem may have limited value, so 
to promote one design that is optimal only with respect to a particular criterion 
of value, Ci, and to declare another design to be of poor value because it is 
not optimal may be unfair because that design may be better with respect to a 
different criterion of value, C 2 say. And for one researcher C\ may be irrelevant, 
whereas C 2 may be more appropriate. 

Considerations of optimality involve, of course, comparison of designs. But 
how does one do this when error reduction needs to be taken into account? 
For example, how does one compare the randomized complete block and the 
Latin square design? Or how does one compare designs when different aspects 
of statistical inference are involved? This was at the basis of heated discussion 
between Neyman, who was interested in hypothesis testing, and Fisher and Yates, 
who were interested in precision of estimation (see Neyman, Iwaszkiewicz, and 
Kolodziejczyk, 1935). 

Informal optimality considerations early on gave probably rise to the heuristic 
(or perhaps mathematical) idea of symmetry and balancedness, and we shall 
encounter these characteristics throughout much of the book. Even though these 
properties do not always guarantee optimality in many cases they lead to near 
optimality. And from a practical point of view that may be good enough. On the 
other hand, if a balanced design is an optimal design, but we cannot use that 
design because of practical constraints and need to use instead a near-balanced 
design, then we have a way to evaluate the efficiency of the design we are going 
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to use. For example, we know that a balanced incomplete block design (Chapter 
2) is optimal. However, we cannot use the design and need to use a partially 
balanced incomplete block design (Chapter 4). We may then choose, if possible, 
a design with efficiency “close” to that of the optimal design. 

Thus, the insistence on an optimal design may be frustrating for the user 
because practical reasons may dictate otherwise and because an experimenter 
rarely has one criterion of value. Rather, he has many criteria of value and is 
in a mathematical programming situation in which he wishes the design to have 
“reasonable” efficiencies with respect to criteria Ci, C 2 , ..., Ck. The dilemma is 
often that the design that is optimal with respect to C\ is completely nonoptimal 
with respect to C 2 . 

In summary, mathematical ideals and requirements of empirical science do 
not always meet, but it is worth trying to find common ground. In the end, prac¬ 
tical considerations may dictate compromises in many instances of experimental 
research. 


1.14 COMPUTATIONAL PROCEDURES 

In this section we shall discuss some computational aspects of performing the 
intrablock analysis and the combined intra- and interblock analysis, mainly in the 
context of SAS procedures (SAS, 2000). For the intrablock analysis (see Section 
1.3) we shall use SAS PROC GLM, and for the combined analysis (see Section 
1.8) we shall use SAS PROC MIXED. 

1.14.1 Intrablock Analysis Using SAS PROC GLM 

Consider the following data set IBD (Table 1.7) with t = A treatments in b = 5 
blocks of size k = 2, such that treatments 1 and 4 are replicated 3 times, and 
treatments 2 and 3 are replicated 2 times. An example might be 5 pairs of 
identical twins representing the blocks, each twin being an experimental unit to 
whom different drugs are assigned according to the given plan. 

The SAS PROC GLM input statements for the intrablock analysis and the 
results are given in Table 1.8. We shall comment briefly on some aspects of the 
SAS output (see Table 1.8): 

1. The coefficient matrix as well as the right-hand side (RHS) of (1.5) are given 
under the heading “The X'X Matrix.” 

2. A generalized inverse for the coefficient matrix X'X is obtained by first 
eliminating the rows and columns for ^5 and T 4 from X'X as a consequence 
of imposing the conditions ^| = 0 and = 0 (we shall denote the SAS 
solutions to the NE by and r*). The reduced matrix is of full rank and 
thus can be inverted. The inverted matrix is restored to the original dimension 
by inserting zeros in the rows and columns corresponding to and . This 
matrix, together with df =MS(Error) from the ANOVA table, can be used to 
find the standard errors for the estimators of estimable functions for treatment 
effects. 
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Table 1.7 Data for Incomplete Block Design (t = 4, b = 5, k = 2, rl = 3, 
r2 = r3 = 2, r4 = 3) 

options pageno=l nodate； 
data IBD； 

input TRT BLOCK Y @@; 
datalines； 

1 1 10 2 1 12 
3 2 23 4 2 28 

1 3 13 3 3 27 

2 4 14 4 4 20 

1 5 15 4 5 32 

/ 

run； 

proc print data=IBD； 
titlel 'TABLE 1.7'; 

title2 'DATA FOR INCOMPLETE BLOCK DESIGN'; 
title3 ' (t=4 , b=5, k=2 , rl=3, r2=r3=2, r4=3) 1 ; 
run； 


Obs 

TRT 

BLOCK 

Y 

1 

1 

1 

10 

2 

2 

1 

12 

3 

3 

2 

23 

4 

4 

2 

28 

5 

1 

3 

13 

6 

3 

3 

27 

7 

2 

4 

14 

8 

4 

4 

20 

9 

1 

5 

15 

10 

4 

5 

32 


3. The general form of an estimable function for treatment effects is given by 


_L7ti + _L8*T2 + Z^9t3 — (Z>7 + Z>8 + _L9)T4 


for any values of L7, L8, and L9; that is, only contrasts are estimable. 

4. The general form of estimable functions can also be used to identify the 
solutions to the NE by putting sequentially (and in order) each Li = 1 and 
the remaining Lj = 0, (j # /). For example, the “Estimate” of “Intercept” 
is actually the estimate of /x + ^5 + T 4 ； that is, LI = l, Lj = 0, (j ^ 1). 
Expressed in terms of the SAS solutions we thus have 

/x* = fi-\- ) 65 + t 4 = 31.125 

Another example, putting LI = \, Lj = 0( j ^ 7), yields 

Tl * = rP ^ 4 =-15.25 
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Table 1.8 Intrablock Analysis with Post-hoc Comparisons 

proc glm data=IBD ； 

Class BLOCK TRT; 

model Y = BLOCK TRT/XPX inverse solution e ； 
lsmeans TRT/stderr e; 

estimate 'TRT1 - (TRT2+TRT3)/2' TRT 1 -.5 -.5 0; 

estimate 'TRT1 - TRT4' TRT 1 0 0 -1; 

estimate 'TRT2 - TRT3' TRT 0 1 -1 0; 

titlel 'TABLE 1.8'; 

title2 'INTRA-BLOCK ANALYSIS'; 

title3 'WITH POST-HOC COMPARISONS'; 

run ； 


The GLM Procedure 


Class Level Information 


Class Levels 


BLOCK 


Values 
1 2 3 4 5 


TRT 


12 3 4 


Number of observations 10 


The X'X Matrix 



Intercept 

BLOCK 1 

BLOCK 2 

BLOCK 3 

BLOCK 4 

BLOCK 5 

Intercept 

10 

2 

2 

2 

2 

2 

BLOCK 1 

2 

2 

0 

0 

0 

0 

BLOCK 2 

2 

0 

2 

0 

0 

0 

BLOCK 3 

2 

0 

0 

2 

0 

0 

BLOCK 4 

2 

0 

0 

0 

2 

0 

BLOCK 5 

2 

0 

0 

0 

0 

2 

TRT 1 

3 

1 

0 

1 

0 

1 

TRT 2 

2 

1 

0 

0 

1 

0 

TRT 3 

2 

0 

1 

1 

0 

0 

TRT 4 

3 

0 

1 

0 

1 

1 

Y 

194 

22 

51 

40 

34 

47 
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Table 1.8 (Continued) 





The X'X Matrix 




TRT 1 

TRT 2 

TRT 3 

TRT 4 

Intercept 

3 

2 

2 

3 

BLOCK 

1 

1 

1 

0 

0 

BLOCK 

2 

0 

0 

1 

1 

BLOCK 

3 

1 

0 

1 

0 

BLOCK 

4 

0 

1 

0 

1 

BLOCK 

5 

1 

0 

0 

1 

TRT 1 


3 

0 

0 

0 

TRT 2 


0 

2 

0 

0 

TRT 3 


0 

0 

2 

0 

TRT 4 


0 

0 

0 

3 

Y 


38 

26 

50 

80 



X r X Generalized 

Inverse (g2) 




Intercept 

BLOCK 1 BLOCK 2 BLOCK 3 

BLOCK 4 

Intercept 

0.75 

-0.375 -0 

.625 -0.375 

-0.625 

BLOCK 

1 

-0.375 

1.3125 0. 

4375 0.5625 

0.6875 

BLOCK 

2 

-0.625 

0.4375 1. 

3125 0.6875 

0.5625 

BLOCK 

3 

-0.375 

0.5625 0. 

6875 1.3125 

0.4375 

BLOCK 

4 

-0.625 

0.6875 0. 

5625 0.4375 

1.3125 

BLOCK 

5 

0 

0 

0 0 

0 

TRT 1 


-0.5 

-0.25 

0.25 -0.25 

0.25 

TRT 2 


-0.25 

-0.625 0 

.125 -0.125 

-0.375 

TRT 3 


-0.25 

-0.125 -0 

.375 -0.625 

0.125 

TRT 4 


0 

0 

0 0 

0 

Y 


31.125 

-7.6875 -4. 

0625 -1.9375 

-9.3125 



X' 

X Generalized 

Inverse (g2) 




TRT 1 

TRT 2 

TRT 3 

TRT 4 

Intercept 

-0.5 

-0.25 

-0.25 

0 

BLOCK 

1 

-0.25 

-0.625 

-0.125 

0 

BLOCK 

2 

0.25 

0.125 

-0.375 

0 

BLOCK 

3 

-0.25 

-0.125 

-0.625 

0 

BLOCK 

4 

0.25 

-0.375 

0.125 

0 

BLOCK 

5 

0 

0 

0 

0 

TRT 1 


1 

0.5 

0.5 

0 

TRT 2 


0.5 

1.25 

0.25 

0 

TRT 3 


0.5 

0.25 

1.25 

0 

TRT 4 


0 

0 

0 

0 

Y 


-15.25 

-9.625 

-3.125 

0 


Y 

194 

22 

51 

40 

34 

47 

38 

26 

50 

80 

4300 


BLOCK 5 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


Y 

31.125 
-7.6875 
-4.0625 
-1.9375 
-9.3125 
0 

-15.25 
-9.625 
-3.125 
0 

18.1875 
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Table 1.8 


Dependent 

Source 

Model 

Error 

Corrected 


Source 

BLOCK 

TRT 

Source 

BLOCK 

TRT 


(Continued) 


General Form of Estimable Functions 


Effect 


Coefficients 

Intercept 


LI 

BLOCK 

1 

L2 

BLOCK 

2 

L3 

BLOCK 

3 

L4 

BLOCK 

4 

L5 

BLOCK 

5 

L1-L2-L3-L4 

TRT 

1 

L7 

TRT 

2 

L8 

TRT 

3 

L9 

TRT 

4 

L1-L7-L8-L9 


The GLM Procedure 

Variable : Y 


DF Sum of 

7 518 

2 18 

Total 9 536 


Squares 

Mean Square 

2125000 

74.0303571 

1875000 

9.0937500 

4000000 



F Value Pr > F 

8.14 0.1137 


R-Square Coeff Var 

0.966093 15.54425 

Root MSE 

3.015585 

19 

Y Mean 

.40000 


DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

4 

261.4000000 

65.3500000 


7.19 

0.1259 

3 

256.8125000 

85.6041667 


9.41 

0.0975 

DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

4 

79.1458333 

19.7864583 


2.18 

0.3388 

3 

256.8125000 

85.6041667 


9.41 

0.0975 
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Table 1.8 (Continued) 


Parameter 


Estimate 

Standard Error 

t Value 

Pr > |t| 

Intercept 


31.12500000 

B 

2.61157280 

11 

• 92 

0.0070 

BLOCK 

1 

-7.68750000 

B 

3.45478608 

-2 

.23 

0.1560 

BLOCK 

2 

-4.06250000 

B 

3.45478608 

-1 

.18 

0.3607 

BLOCK 

3 

-1.93750000 

B 

3.45478608 

-0 

.56 

0.6314 

BLOCK 

4 

-9.31250000 

B 

3.45478608 

-2 

.70 

0.1145 

BLOCK 

5 

0.00000000 

B 





TRT 

1 

-15.25000000 

B 

3.01558452 

-5 

.06 

0.0369 

TRT 

2 

-9.62500000 

B 

3.37152599 

-2 

• 85 

0.1039 

TRT 

3 

-3.12500000 

B 

3.37152599 

-0 

• 93 

0.4518 

TRT 

4 

0.00000000 

B 






NOTE ： The X ' X matrix has been found to be singular, and a gen¬ 
eralized inverse was used to solve the normal equations. 
Terms whose estimates are followed by the letter 'B 1 are not 
uniquely estimable. 

Least Squares Means 

Coefficients for TRT Least Square Means 
TRT Level 


Effect 




1 

2 


3 

4 

Intercept 



1 

1 


1 

1 

BLOCK 


1 

0 

.2 

0.2 

0 . 

2 

0.2 

BLOCK 


2 

0 

.2 

0.2 

0 . 

2 

0.2 

BLOCK 


3 

0 

.2 

0.2 

0 . 

2 

0.2 

BLOCK 


4 

0 

.2 

0.2 

0 . 

2 

0.2 

BLOCK 


5 

0 

.2 

0.2 

0 . 

2 

0.2 

TRT 


1 


1 

0 


0 

0 

TRT 


2 


0 

1 


0 

0 

TRT 


3 


0 

0 


1 

0 

TRT 


4 


0 

0 


0 

1 

TRT 

Y LSMEAN 



Standard 

Error 


Pr > |t| 

1 

11 

.2750000 



1.9774510 


0.0294 

2 

16 

.9000000 



2.6632921 


0.0239 

3 

23 

.4000000 



2.6632921 


0.0127 

4 

26 

. 5250000 



1.9774510 


0.0055 


Dependent Variable : Y 

Parameter Estimate Standard Error t Value Pr > 111 

TRT1 - (TRT2+TRT3)/2 -8.8750000 2.61157280 -3.40 0.0768 

TRT1 - TRT4 -15.2500000 3.01558452 -5.06 0.0369 

TRT2 - TRT3 -6.5000000 4.26468053 -1.52 0.2670 
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5. The top part of the ANOVA table provides the partition 

SS(MODEL) + SS(ERROR) = SS(TOTAL) 
and from that produces 

MS(ERROR) = a； = 9.094 

6. The lower part of the ANOVA table provides type I SS (sequential SS for 
ordered model; see 1.4.7.2) and type III SS (partial SS). From the latter we 
obtain the P value (.0975) for the test of 

Hq: zi = T2 = = r t 

versus 


Hi : not all zi are the same 

We stress that the P value for blocks should be ignored (see 1.9.2). 

7. The e-option of LSMEANS gives the coefficients for the solution vector to 
compute the treatment least squares means, for example, 

LSMEAN(TRT 1) =/x* + .2 ^ + rf 

= 31.125 + .2(-7.6875 - 4.0625 - 1.9375 

- 9.3125 + 0)- 15.25 

= 11.275 

The standard error is computed by making use of the G inverse (see item 2) 
and MS(ERROR). 

8. The t tests are performed for the prespecified contrasts among the least- 
squares means; for example, 

TRT2-TRT3 = r| - r 3 * = 9.625 + 3.125 = - 6.5 

se{xl - r 3 *) = [(1.25 + 1.25 - 2 x .25) x 9.094] 1/2 = 4.265 

- 6.5 

t = - = —1.52 


1.14.2 Intrablock Analysis Using the Absorb Option in SAS PROC GLM 

A computational method as described in Section 1.3 using the RNE can be imple¬ 
mented in SAS PROC GLM by using the ABSORB OPTION. This is illustrated 
in Table 1.9. 
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Table 1.9 Intra Block Analysis Using Reduced Normal Equations 
With Post-hoc Comparisons 

proc glm data=IBD; 

Class BLOCK TRT; 
absorb BLOCK ； 

model Y = TRT/XPX inverse solution e ； 


estimate 'TRT1 - (TRT2+TRT3)/2' TRT 1 -.5 -.5 0; 
estimate 'TRT1 - TRT4' TRT 1 0 0 -1; 
estimate 'TRT2 - TRT3' TRT 0 1 -1 0 ； 
titlel 'TABLE 1.9'; 

title2 'INTRA-BLOCK ANALYSIS USING REDUCED NORMAL EQUATIONS'; 

title3 'WITH POST-HOC COMPARISONS'; 

run ； 

The GLM Procedure 
Class Level Information 


Class Levels Values 

BLOCK 5 12345 


TRT 


4 12 3 4 


Number of observations 10 


The X'X Matrix 






TRT 1 

TRT 2 

TRT 3 

TRT 4 


Y 



TRT 

1 

1.5 

-0.5 

-0.5 

-0.5 


-16.5 



TRT 

2 

-0.5 

1 

0 

-0.5 


-2 



TRT 

3 

-0.5 

0 

1 

-0.5 


4.5 



TRT 

4 

-0.5 

-0.5 

-0.5 

1.5 


14 



Y 


-16.5 

-2 

4.5 

14 


275 






The GLM 

Procedure 








X'X 

Generalized Inverse 

(g2) 







TRT 1 

TRT 2 

TRT 3 

TRT 

4 

Y 

TRT 

1 



1 

0.5 

0.5 


0 

-15.25 

TRT 

2 



0.5 

1.25 

0.25 


0 

-9.625 

TRT 

3 



0.5 

0.25 

1.25 


0 

-3.125 

TRT 

4 



0 

0 

0 


0 

0 

Y 



- 

15.25 

-9.625 

-3.125 


0 

18.1875 


General Form of Estimable Functions 


Given that the coefficients for all absorbed effects are 


zero 
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Table 1.9 {Continued) 


Effect 


Coefficients 

TRT 

1 

LI 

TRT 

2 

L2 

TRT 

3 

L3 

TRT 

4 

-L1-L2-L3 


Dependent Variable : Y 

Sum of 


Source 


DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 


7 

518.2125000 

74.0303571 

8.14 

0.1137 

Error 


2 

18.1875000 

9.0937500 



Corrected Total 9 

536.4000000 






R-Square 

Coeff Var 

Root MSE 

Y Mean 




0.966093 

15.54425 

3.015585 19 

.40000 


Source 


DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

BLOCK 


4 

261.4000000 

65.3500000 

7.19 

0.1259 

TRT 


3 

256.8125000 

85.6041667 

9.41 

0.0975 

Source 


DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

TRT 


3 

256.8125000 

85.6041667 

9.41 

0.0975 





Standard 



Parameter 


Estimate 

Error 

t Value 

Pr > |t| 

TRT1 - 

(TRT2+TRT3) /2 

-8.8750000 

2.61157280 

-3.40 

0.0768 

TRT1 - 

TRT4 


-15.2500000 

3.01558452 

-5.06 

0.0369 

TRT2 - 

TRT 3 


-6.5000000 

4.26468053 

-1.52 

0.2670 





Standard 



Parameter 


Estimate 

Error 

t Value 

Pr > |t| 

TRT 


1 -15 

•25000000 B 

3.01558452 

-5.06 

0.0369 

TRT 


2 -9 

•62500000 B 

3.37152599 

-2.85 

0.1039 

TRT 


3 -3 

•12500000 B 

3.37152599 

-0.93 

0.4518 


TRT 4 0.00000000 B . 

NOTE: The X'X matrix has been found to be singular, and a gen¬ 
eralized inverse was used to solve the normal equations. 
Terms whose estimates are followed by the letter 'B' are not 
uniquely estimable. 
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We make the following comments about the SAS output: 

1. The X'X matrix is now the C-matrix of (1.9). 

2. The X'X generalized inverse is obtained by the SAS convention of setting 
T 4 = 0. This g inverse is therefore different from C 1 of (1.16). 

3. The ANOVA table provides the same information as in Table 1.8, except that 
it does not give solutions for the intercept and blocks. Hence, this analysis 
cannot be used to obtain treatment least-squares means and their standard 
error. 


1.14.3 Combined Intra- and Interblock Analysis Using the 
Yates Procedure 

Using SAS PROC VARCOMP illustrates the estimation of according to the 
method described in Section 1.10.1. The result is presented in Table 1.10. The 
option type I produces Table 1.2 with 


E[MS(BLOCK)] = E MS(Xp\0, X r ) 


as given in (1.82). This yields = 
Substituting 9，Q9 ^( )9 X 7，B 


9.09 (as in Table 1.8) and = 7.13. 
= 2.57 into (1.60) we obtain 


/ 2.0814 -0.3062 
-0.3062 1.3877 

-0.3062 0.0000 

\-0.3062 -0.3062 


-0.3062 -0.3062\ 
0.0000 -0.3062 
1.3877 -0.3062 
-0.3062 2.0814 / 


and 


/0.541654 0.146619 
0.146619 0.785320 
0.146619 0.064704 
\0.122823 0.146619 


0.146619 0.122823 、 
0.064704 0.146619 
0.785320 0.146619 
0.146619 0.541654 / 


and 


/ 4.6242\ 
8.8528 
22.1358 
\39.5816 / 



(uJJnsm-HJJSH 


{•XOJCJCB^laJA 

(yusa) SA 

JJsuodmou(uuuas-HJceA 


(JCSJCW) JCtTJA 
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(EHaEH)a+ (yucyIPQ)^f6>A999.0 + (JCOJC:CH)：CCOA 

a)JC(IJnbs u(TJa)sp(u4JuQ)dxw 


sJ-JOJm-HPSH T(udAIL 


OSASO ■ 6 

8 SS 00「6 T 


sssse. 9 寸 T 


aJJC(IJnbsr!c 6 a)w 


00000 寸 • 9 S 

os£T. 8T 

ess 寸 T . 6 A 

£9990 • 65 

ma)JCnJnbs 
jo mncn 


a) uut6-H:Ito >M-l o s -H s >1 rH to uv H(D&E-I 

X : stGUJCeA puspuadaa 


OH suo-H 4 Jt 6 AJI0sqo jo Jcsqulns 


S 寸 ro 3 T 


uo-HJJtTJmJOlHUIIHa)>a)T ssajxu 
(uJcnpaJuOJCd uo-H4-)cllm-H4-)sHs4-)u(uuodmouaJouns-HJCclIA 


IHttJJJoE-l^aJpoaJJCJCOU 


WHnawuOHtJ swivAHmisNISn-ro(DIH4J-H4J 

• 〜 *EHNWNOC3woulxlUNVIHV>^ucyIPQho NOIE-IVWIE-ISW - 3(DT4J-H4J 

--0H-H spqveh. T3I4J-H4J 
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We then obtain 


c _ 24.4379 

\26.5510/ 

We note that the elements of r are actually the treatment least-squares means. 

Their estimated variances and the estimated variances for the treatment contrasts 
〜 _ l 

are obtained from A x 9.09 (see Tables 1.13 and 1.14). 

1.14.4 Combined Intra- and Interblock Analysis Using SAS 
PROC MIXED 

We illustrate here the numerical implementation of the ML and REML proce¬ 
dures as described in Sections 1.11.1 and 1.11.2, respectively, using SAS PROC 
MIXED. The results of the ML estimation are given in Table 1.11. 

It takes four interations to obtain the solutions, yielding = 7.45 and df = 
4.14 and hence y = 1.80 (Notice that these are quite different from the estimates 
obtained by the Yates procedure (Section 1.14.3) and the REML procedure as 
given below). 

Since SAS uses a different parametrization than the one used in (1.84) it 
obtains “estimates” of /x and T/ (/ = 1, 2, 3, 4) separately. The type 3 coefficients 
indicate that the solutions for /x, r, (/ = 1, 2, 3,4) are actually estimates of /z + 
t4, T \ — t ： 4, T2 — r4, T3 — 7:4, respectively. From these solutions the least-squares 
means are then obtained as 


LSMEAN(TRT 1)= 

= 

=11.65 = 

= a\ 

LSMEAN(TRT 2)= 

=M +^2 

= 15.63 = 

= 0(2 

LSMEAN(TRT 3)= 

=M+^3 

= 24.09 二 

= 0(3 

LSMEAN(TRT 4)= 

=M 

= 26.53 : 

= S 4 


where the S/ denote the solutions to (1.87). 

The REML procedure is illustrated in Table 1.12. It takes three iterations to 
obtain the estimates = 6.35 and = 10.17, and hence y = 0.62. We note 
that the REML and ML least-squares means are numerically quite similar even 
though y and are substantially different from y and respectively. 

1.14.5 Comparison of Estimation Procedures 

For a small proper incomplete block design we have employed the above four 
methods of estimating treatment least-squares means and treatment comparisons: 


广 11. 9097 、 
14.8659 


Ml: Intrablock analysis 
M2: Combined analysis: Yates 
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Table 1.11 Combined Analysis Using Maximum Likelihood 
With Post-hoc Comparisons 


proc mixed data=IBD method=ML ； 


Class BLOCK TRT; 

model Y = TRT/ solution E3 ddfm=Satterth ； 


random BLOCK ； 
lsmeans TRT ； 

estimate 'TRT1 - (TRT2+TRT3)/2' TRT 1 -.5 -.5 0; 
estimate 'TRT1 - TRT4' TRT 1 0 0 -1; 
estimate 'TRT2 - TRT3' TRT 0 1 -1 0; 
titlel 'TABLE 1.11'; 

title2 'COMBINED ANALYSIS USING MAXIMUM LIKELIHOOD'; 

title3 'WITH POST-HOC COMPARISONS'; 

run ； 

The Mixed Procedure 


Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Estimation Method 
Residual Variance Method 
Fixed Effects SE Method 
Degrees of Freedom Method 


WORK.IBD 
Y 

Variance Components 
ML 

Profile 
Model-Based 
Satterthwaite 


Class Level Information 

Class Levels Values 


BLOCK 5 12345 

TRT 4 1234 


Dimensions 


Covariance Parameters 2 
Columns in X 5 
Columns in Z 5 
Subjects 1 
Max Obs Per Subject 10 
Observations Used 10 
Observations Not Used 0 
Total Observations 10 




COMPUTATIONAL PROCEDURES 


65 


Table 1.11 (Continued) 


Iteration History 


Iteration 

0 

1 

2 

3 

4 


Evaluations 

1 

2 

1 

1 

1 


-2 Log Like 

51.13433487 
50.29813032 
50.22541829 
50.22033455 
50.22029773 


Criterion 


0.00396161 
0.00030283 
0.00000230 
0 . 00000000 


Convergence criteria met. 

Covariance Parameter 
Estimates 


Cov Parm 

BLOCK 

Residual 


Estimate 

7.4528 
4.1426 



Fit Statistics 


-2 Log Likelihood 

50.2 

AIC 

(smaller is better) 

62.2 

AICC 

(smaller is better) 

90.2 

BIC 

(smaller is better) 

59.9 


Solution for Fixed Effects 


Effect 

TRT 

Estimate 

Standard 

Error 

DF 

t Value 

Pr > |t| 

Intercept 


26.5329 

1.7767 

8.87 

14.93 

< . 0001 

TRT 

1 

-14.8822 

1.9330 

4.75 

-7.70 

0.0007 

TRT 

2 

-10.9029 

2.1495 

4.82 

-5.07 

0.0043 

TRT 

3 

-2.4380 

2.1495 

4.82 

-1.13 

0.3099 

TRT 

4 

0 






Effect 

Type 3 Coefficients 

TRT Rowl 

for TRT 

Row2 

Row 3 



Intercept 

TRT 1 1 

TRT 2 1 

TRT 3 

TRT 


4 


1 
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Table 1.11 (Continued) 


Type 3 Tests of Fixed Effects 



Num 

Den 



Effect 

DF 

DF 

F Value 

Pr > F 

TRT 

3 

4.76 

23.37 

0.0028 


Estimates 


Standard 


Label 


Estimate 

Error DF 

t Value 

Pr > |t| 

TRT1 - 

(TRT2+TRT3)/2 -8.2117 

1.7085 4.3 

-4.81 

0.0072 

TRT1 - 

TRT4 

-14.8822 

1.9330 4.75 

-7.70 

0.0007 

TRT2 - 

TRT 3 

-8.4650 

2.6087 5.86 

-3.24 

0.0182 



Least 

Squares Means 





Standard 



Effect 

TRT 

Estimate 

Error DF 

t Value 

Pr > |t| 

TRT 

1 

11.6506 

1.7767 8.87 

6.56 

0.0001 

TRT 

2 

15.6299 

2.0786 9.98 

7.52 

<.0001 

TRT 

3 

24.0949 

2.0786 9.98 

11.59 

<.0001 

TRT 

4 

26.5329 

1.7767 8.87 

14.93 

<.0001 


M3: Combined analysis: ML 
M4: Combined analysis: REML 

In Tables 1.13 and 1.14 we present the estimates and their standard errors (exact 
or approximate) for these methods for purely numerical comparisons. 

Based on the numerical results, we make the following observations, which 
should not necessarily be generalized: 

1. For the least-squares means, Ml produces slightly smaller standard errors 
than M2, but the result is reversed for the contrast estimates. 

2. The results for M2 and M4 are very similar, both with respect to estimates 
and standard errors. 

3. In both tables M3 produces the smallest standard errors. 

1.14.6 Testing of Hypotheses 

To test the hypothesis 


Ho-T\ = T 2 = • • • = r t 
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Table 1.12 Combined Analysis Using Residual Maximum Likelihood With 
Post-hoc Comparisons 


proc mixed data=IBD ； 


Class BLOCK TRT; 

model Y = TRT/ solution E3 ddfm=Satterth ； 
random BLOCK ； 
lsmeans TRT ； 

estimate 'TRT1 - (TRT2+TRT3)/2' TRT 1 -.5 -.5 0; 
estimate 'TRT1 - TRT4' TRT 1 0 0 -1; 
estimate 'TRT2 - TRT3' TRT 0 1 -1 0 ； 
titlel 'TABLE 1.12'; 

title2 'COMBINED ANALYSIS USING RESIDUAL MAXIMUM LIKELIHOOD 1 ; 

title3 'WITH POST-HOC COMPARISONS'; 

run ； 

The Mixed Procedure 


Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Estimation Method 
Residual Variance Method 
Fixed Effects SE Method 
Degrees of Freedom Method 


WORK.IBD 
Y 

Variance Components 

REML 

Profile 

Model-Based 

Satterthwaite 


Class Level Information 

Class Levels Values 


BLOCK 5 12345 

TRT 4 1234 


Dimensions 


Covariance Parameters 2 
Columns in X 5 
Columns in Z 5 
Subjects 1 
Max Obs Per Subject 10 
Observations Used 10 
Observations Not Used 0 
Total Observations 10 
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Table 1.12 (Continued) 





Iteration History 




Iteration 

Evaluations -2 

Res Log 

Like 

Criterion 



0 


1 

37.32907360 




1 


2 

37.14557447 

0.00022052 



2 


1 

37.14253025 

0.00000186 



3 


1 

37.14250585 

0.00000000 




Convergence criteria 

met. 






Covariance Parameter 






Estimates 







Cov Parm 

Estimate 






BLOCK 


6.3546 






Residual 


10.1681 






Fit 

Statistics 






- 

2 Res Log Likelihood 

37 . 

1 




AIC (smaller 

is better) 

41. 

1 




AICC (smaller is better) 

45 . 

1 




BIC (smaller 

is better) 

40 . 

4 





Solution 

for Fixed Effects 







Standard 




Effect 

TRT 


Estimate 

Error 

DF 

t Value 

Pr > !t| 

Intercept 



26.5596 

2.2615 

5.93 

11.74 

<.0001 

TRT 

1 


-14.5682 

2.8843 

2.68 

-5.05 

0.0196 

TRT 

2 


-11.9152 

3.2045 

2.66 

-3.72 

0.0415 

TRT 

3 


-2.0305 

3.2045 

2.66 

-0.63 

0.5766 

TRT 

4 


0 








Type 3 Coefficients for TRT 




Effect 

TRT 

Rowl 

Row2 

Row3 



Intercept 






TRT 


1 

1 





TRT 


2 


1 




TRT 


3 



1 



TRT 


4 

-1 

-1 

-1 
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Table 1.12 (Continued) 


Type 3 Tests of Fixed Effects 



Num 

Den 



Effect 

DF 

DF 

F Value 

Pr > F 

TRT 

3 

2.42 

10.82 

0.0615 


Estimates 


Label 


Estimate 

Standard 

Error 

DF 

t Value 

Pr > |t| 

TRT1 - 

(TRT2+TRT3)/2 

-7.5953 

2.5979 

2.2 

-2.92 

0.0891 

TRT1 - 

TRT4 

-14.5682 

2.8843 

2.68 

-5.05 

0.0196 

TRT2 - 

TRT3 

-9.8847 

3.7522 

3.82 

-2.63 

0.0607 


Least Squares Means 


Effect 

TRT 

Estimate 

Standard 

Error 

DF 

t Value 

Pr > |t| 

TRT 

1 

11.9914 

2.2615 

5.93 

5.30 

0.0019 

TRT 

2 

14.6444 

2.7365 

5.52 

5.35 

0.0023 

TRT 

3 

24.5291 

2.7365 

5.52 

8.96 

0.0002 

TRT 

4 

26.5596 

2.2615 

5.93 

11.74 

<.0001 


versus 


we consider a set of ^ — 


H\ \ not all Xi are equal 

1 linearly independent contrasts, say Cr, and test equiv¬ 


alently 


H 0 ： Cr =0 


Table 1.13 Comparison of Least-Squares Means 


TRT 

Ml 


M2 


M3 


M4 

i 

LSM(TRT,) 

SE 

LSM(TRT t ) 

SE 

LSM(TRT/) 

SE 

LSM(TRT/) SE 

1 

11.28 

1.98 

11.91 

2.22 

11.65 

1.78 

11.99 2.26 

2 

16.90 

2.66 

14.87 

2.67 

15.63 

2.08 

14.64 2.74 

3 

23.40 

2.66 

24.44 

2.67 

24.09 

2.08 

24.53 2.74 

4 

26.53 

1.98 

26.55 

2.22 

26.53 

1.78 

26.56 2.26 
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Table 1.14 Comparison of Contrast Estimates 



Ml 

M2 

M3 

M4 


CONTRAST^ 

C 

SE 

C 

SE 

c 

SE 

C 

SE 

Cl 

-8.88 

2.61 

-7.75 

2.47 

-8.21 

1.71 

-7.59 

2.60 

C2 

-15.25 

3.02 

-14.64 

2.76 

-14.88 

1.93 

-14.57 

2.88 

C3 

-6.50 

4.26 

-9.57 

3.62 

-8.47 

2.61 

-9.88 

3.75 


a Cl = TRT1 - (TRT2 + TRT3)/2 
C2 = TRT1 - TRT4 
C3 = TRT2 - TRT3 


Table 1.15 Comparison of Testing Ho ： r\ = T 2 = T 3 = T 4 


Method 

F Ratio 

Denominator 

d.f. 

P Value 

Source 

Ml 

9.41 

2 

.0975 

Table 1.8 

M2 

11.73 

2 

.0796 

See below 

M3 

23.37 

4.76 

.0028 

Table 1.11 

M4 

10.82 

2.42 

.0615 

Table 1.12 


versus 


Hi ： Cr ^0 

We then compute the test statistic 

F = (Cr)'[CA~ l Cr l Cr/[(t - 1) MS(E)] 

which follows approximately an F distribution with t — l and n — t — b l d.f. 
For the data set in Table 1.7, using 

/I -1 0 0\ 

C= 1 0-1 0 

\1 0 0 - 1 / 

with the Yates procedure we obtain F = 11.73 with 3 and 2 d.f. 

A comparison of the four methods of analysis [as described in (Section 1.13.5)] 
concerning the test of treatment effects is given in Table 1.15 

It is interesting to note that the results for Ml, M2, and M4 are in close 
agreement, whereas the result for M3 is quite different. This appears due to the 
fact that the estimate of for M3, namely 4.1426, is quite different from the 
corresponding estimates using M2, namely, 9.09375, and M4, namely 10.1681. 






CHAPTER 2 


Balanced Incomplete Block Designs 


2.1 INTRODUCTION 

In the previous chapter we were concerned with incomplete block designs in 
very general terms, mainly with the analyses of these designs. But as men¬ 
tioned earlier, there exist, among the totality of all incomplete block designs, 
several special types that have evolved in the process of refining the art of 
experimental design, these types being characterized by some common feature 
or property. 

One such type of design is the balanced incomplete block (BIB) design intro¬ 
duced by Yates (1936a) (see also Section 1.9.8.2). This is a proper, equireplicate, 
binary design such that each elementary treatment contrast, that is, the difference 
between two treatment effects, is estimated with the same variance. 


2.2 DEFINITION OF THE BIB DESIGN 


In this section we shall give a more precise and more formal definition of the 
BIB design. First we note that for a binary design the incidence matrix N = (nij) 
has elements 



if treatment i occurs in block j 
otherwise 


Furthermore, for an equireplicate design 


b 

riij = r for all i 


( 2 . 1 ) 


Design and Analysis of Experiments. Volume 2: Advanced Experimental Design 

By Klaus Hinkelmann and Oscar Kempthorne 

ISBN 0-471-55177-5 Copyright © 2005 John Wiley & Sons, Inc. 


71 



72 


BALANCED INCOMPLETE BLOCK DESIGNS 


and for a proper design 

t 

= k for all j (2.2) 

i=l 

where r is the number of replications for each treatment and k is the block size. 

We denote the (/， /’）element of NN f by 入 that is, mjrii，j = 入 “•，， where 
入 denotes simply the number of blocks in which treatments i and i' appear 
together (it is for this reason that NN f is referred to as the concordance or 
concurrence matrix ; see Pearce, 1963). Then, with (2.1) and (2.2) and the fact that 
all differences between treatment effects are estimated with the same variance, 
we must have ku，= k for all i, i\i ^ /’)， as was shown by Thompson (1956) 
and Rao (1958). We thus have the following definition. 

Definition 2.1 An incomplete block design is said to be a balanced incom¬ 
plete block (BIB) design if it satisfies the following conditions: 

i. The experimental material is divided into b blocks of k units each, different 
treatments being applied to the units in the same block. 

ii. There are t treatments each of which occurs in r blocks. 

iii. Any two treatments occur together in exactly 入 blocks. 

The quantities t,b ， r ， k, and 入 are called the parameters of the BIB design. 
We note here that for a given set of parameters there may or may not exist a 
BIB design. □ 

2.3 PROPERTIES OF BIB DESIGNS 

The following relations hold among the parameters, and even these are only 
necessary conditions for the existence of a BIB design: 


rt = kb (2.3) 

入 0 - 1 ) =r{k- 1) (2.4) 

r > X (2.5) 

b > t (2.6) 


Relationship (2.3) follows immediately from the fact that the total number of 
observation for the BIB design is 

n = E (!>")=!] ([叫 ） 

and to this we apply (2.1) and (2.2). 
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To establish (2.4) we consider a fixed treatment that is replicated r times and 
in each of these r blocks there are 众 一 1 other treatments. On the other hand, 
because of (iii) in Definition 2.1, each of the remaining t — l treatments must 
occur in these blocks exactly 入 times. This leads immediately to (2.4). 

Condition (2.5) follows from (2.4) and the fact that for an incomplete block 
design k < t. 

Fisher (1940) first proved (2.6) to be a necessary condition for the existence of 
a balanced incomplete block design. The following proof is based on a property 
of the matrix NN\ From the definition of the BIB design it follows that 


Then 


-r \ AT 

入 r • 


NN，= 


=( 厂一入) / + 入 33’ 


L 入 入 r 」 


NN r 3 = (r - A)3 + tXO 
=(r — A, + Xt)0 


(2.7) 


So (1/V?)3 is a normalized eigenvector with root r — 入 + 入 ，. If § is another 
eigenvector of NN\ which is orthogonal to then we have from 

(2.7) that 


NN 飞 = (r- 


and hence (r — X) is an eigenvalue with multiplicity t — l. It follows then that 


deiNN' = (r - A.) f_1 (r - 入 + 入 0 


or, using (2.4), 


detiV^V' = (r-XY^rk (2.8) 

Because of (2.5), this determinant is positive definite of rank t. Now t = rank 
NN’ = rank N < min(?, b). Hence b > t. 

In connection with the above necessary conditions for the existence of a BIB 
design, we shall mention briefly the special case when t = b and hence r = k. 

Definition 2.2 An incomplete block design (IBD) with the same number of 
treatments and blocks is called a symmetrical incomplete block design. 
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For a symmetrical BIB design, (2.8) will be of the form 
detNN，= (detAO 2 = (r 一 入 / - V 2 

This implies, for example, that if t is even then r — 入 must be a perfect square 
for a symmetrical BIB design to exist. This is only one of several necessary 
conditions (Shrikhande, 1950) for the existence of a symmetrical BIB design. 

Nonexistence theorems and methods of actually constructing BIB designs for 
a given set of parameters have become an important subject in itself, and we 
shall treat some aspects of this in Chapter 3. 

Before we consider the analysis of BIB designs, we give as an example the 
design mentioned in Section 1.1. □ 

Example 2.1 We have t = l，b = l，r = 4，k = 4，k = 2 and denoting the 
treatments now by 0, 1,, 6 the design is listed as 

(0, 3, 6, 5) 

(2, 5, 4, 6) 

(6, 0, 1, 4) 

(0, 1, 2, 5) 

(1, 6, 2, 3) 

(4, 2, 3, 0) 

(1, 3, 4, 5) 

where each row represents a block, the treatments in each block being randomly 
assigned to the experimental units. □ 

2.4 ANALYSIS OF BIB DESIGNS 
2.4.1 Intrablock Analysis 

Using the results of Section 1.3.2 the RNE (1.7) take on the form 


1 2 *f. 

G 2 :• G 


rrllr 2 .:L^ 


A A 


T^ 


入一灸 

I 






入一众 


A -T^ 

I 
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or 


- 1 入、 入…, 

~ + k) I ~k 33 


r= Q 


(2.9) 


A solution to (2.9) is obtained by using Theorem 1.1 with a = Then 

-1 1 , 


C 


Using (2.4) in (2.10) we can write 


k- 1 入 

r nr + k 


r k -^l+ X T = r T ( k -l k — V 


and hence 


k k k 


C 


t- 1 


1 


(k - \)t 
k(t- 1) 


( 2 . 10 ) 


(k — l)t 
k(t - 1) 


( 2 . 11 ) 


Recall that C is the variance - covariance matrix for estimable functions of 
treatment effects. Thus, from (2.11) we have 


var (ji - 节 /) 


2 


,2 


(k- l)t 
k(t - 1) 


for all i ♦ i’. Therefore also 


av. 

i 手 i ， 


var(rj - 务 ） = 


2 

(k - \)t 
r k(t- 1) 



It follows then from (1.89) that (k — l)t/k(t — 1) is the efficiency factor E of the 
BIB design and according to (1.91) the upper bound for any IBD. We then write 



尸 = 丄 / 

rE 

(2.12) 

Hence 

^ Q 

X = —— 

rE 

(2.13) 
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is a solution to (2.9). Furthermore, it follows from (2.13) and (1.20) that any 
linear estimable function c f r = C[ r/ with [c/ = 0 is estimated by 


E 


Ci 


Qi_ 

~rE 


(2.14) 


with variance, using ( 2 . 12 ), 


E< 


rE 


(2.15) 


The intrablock error variance is estimated in the usual way by using the 
analysis of variance given in Table 1.2. 


2.4.2 Combined Analysis 

It follows from the definition of the BIB design and the form of NN r as given 
in (2.7) that the coefficient matrix, A say, in (1.60) is now of the form 


A 


r-d-p- 1 ) 


V f 
k, 






(2.16) 


By a familiar result in matrix algebra (e.g. ， Graybill, 1969) we find 


A 


1 


r-(l- p- 1 ) 


r 入、 

,k k, 


i 入 

(1 - p -、 


r -(l- p~ l ) 


I 八 


30' 


(2.17) 


Denoting the (i, i f ) element of A -1 by a 11 , it follows from (1.62) and (1.63) that 
the linear function of treatment effects, c’r ， with c i = 0 is estimated by c f1 r 
with variance 


var 


^ Ci% I = ^cfa ll cr^ + ^ CiC^a 11 'c 
\ i / i 

i^i r 

Substituting the elements of (2.17) into (2.18) yields 


(2.18) 


var|^]c^] 


r - (l- p- 1 ) 


r V 


(2.19) 
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2.5 ESTIMATION OF p 

As we already know from the discussion in the previous chapter and as is obvious 
from (2.17), the combined estimator c ri t for c'x depends on w; = l/<7^ 2 and w r = 

1/ (o"f + kcr^j through p = w/w r . Since, in general, these parameters are not 
known, they will have to be estimated from the data; the estimates will have to 
be substituted into (2.17) in order to obtain the estimator for c'x, which we shall 
denote by c'x. Note that in our notation c{x depends on p and c'x depends on 
p, where p is an estimator for p. 

In the previous chapter we described one method of estimating p, the Yates 
procedure, which estimates crj and from the intrablock analyses of variance 
with the model assuming the block effects to be random variables. This is a 
general procedure and as such applicable also to BIB designs. However, we also 
mentioned the problem that arises when using the Yates estimators, since one is 
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not assured that the combined estimator for c’t is better than the corresponding 
intrablock estimator. 

The following theorem, which is a special case of a more general theorem 
given by Shah (1964), deals with this problem and gives a solution under a 
nonrestrictive condition. 

Let 

1 " 

SS(m 2r'?+ - SS(Z t |3, Xp) (2.21) 
E _ 

where SS(X r |3) and SS(Z r |3, ) are given in Tables 1.2 and 1.1, respectively, 

T f = (7], 了 2, … ， T t ) is the vector of the treatment totals and is given by (2.13). 


ry _ t(t — l)k(k — 1) 


Theorem 2.1 For t > 6 and 



t 一 k 

t{k- 1) |_0 _ l)MS E 


Z k(t - 1) 
MS e > t-k 


otherwise 


( 2 . 22 ) 


with Z defined by (2.21) and MS ^： = MS(/|3, X T ) being the residual mean 
square as given in Table 1.1, the combined estimator, c’r, for c't is uniformly 
better than the intrablock estimator, c f< c, that is, 


varC^r) < var(c / T) 


For the proof we refer to Shah (1964). 

Similar estimators for p have been developed by Seshadri (1963) and still 
others by Graybill and Weeks (1959) and Graybill and Deal (1959). 

We shall comment briefly on the nature of these estimators without going 
into any detail. Of particular interest is, of course, their relationship to the Yates 
estimator and the comparison of the performance of all the estimators with regard 
to estimating differences of treatment effects. 

We note that the Yates estimator utilizes all b — 1 d.f. between blocks as 
does the estimator proposed by Graybill and Weeks (1959), which, however, was 
shown by Seshadri (1963) to be not as good as the Yates estimator. All the other 
estimators utilize only part of the ^ — 1 d.f., either t — l or b — t. Under certain 
conditions these estimators have been shown to give rise to estimators for treat¬ 
ment differences uniformly better than the corresponding intrablock estimators. 
Such a property has not yet been proved for the Yates procedure, but the fact that 
it uses more information than the other estimators may lead one to suspect that 
it might actually be better than the estimators mentioned above that enjoy this 
property. To some extent this aspect was investigated by Shah (1970) along the 
following lines. 
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Let V be the average variance for treatment comparisons. Then V can be 
expressed as 

—— 

厂本 

where r* is defined as the effective number of replications. For example, r* cor¬ 
responding to the intrablock estimator is given by rj = rE. The largest possible 
value for r* is that corresponding to the combined estimator when p is known. 
Denote this value by r*. Since usually p is not known and hence has to be esti¬ 
mated from the data, some loss of information is incurred; that is, r*(P) < 
where r*(jo) is the effective number of replications when an estimator p is used 
instead of p in the combined analysis. Shah (1970) shows that for his estimator 
(given in Theorem 2.1) the difference between r* and r*(p) is not appreciable 
for moderate values of p. This implies that the loss of information is not great 
and not much improvement, if any, can be expected from any other estimation 
procedure, including the Yates procedure. On the other hand we know that for 
Shah’s estimator r*(p) > when t > 6 (see Theorem 2.1), which is not guar¬ 
anteed for the Yates procedure. For large values of p the difference between r* 
and r* is fairly small, so that not much is gained by using a combined estimator. 

The conclusion from this discussion then is that the Shah procedure leads in 
general to quite satisfactory results and should therefore be used in the combined 
analysis. 


2.6 SIGNIFICANCE TESTS 

As mentioned in Section 1.13.4 significance tests concerning the treatment effects 
are performed as approximate F test by substituting p for p in the coefficient 
matrix A of (1.60), using any of the previously described methods of estimating 
p. Such tests are mostly conveniently performed by choosing any option in SAS 
PROC MIXED. 

An exact test, based, however, on the assumption of normality, for 
Ho., ti = T 2 = • • • = 

against 

H\ \ not all X[ equal 

was developed by Cohen and Sackrowitz (1989). We shall give a brief descrip¬ 
tion here, but the reader should refer to the original article for details. The test 
is based on invariance properties and utilizes information from the intrablock 
analysis (as described in Sections 2.4.1 and 1.3) and the interblock analysis (as 
described for the general case in Section 1.7.2) by combining the P values from 
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the two respective independent tests. In order to define the test we establish the 
following notation: 


T = solution to the intrablock normal equations, (2.13) 
r* = solution to the interblock normal equations, (1.48 )， 
with L = I 


O = matrix of t — l orthonormal contrasts for treatment effects 


Ui = Or 
U 2 = Or* 

U l t = U\U l 

u 2 \\ 2 = u , 2 u 2 

Vi = t/i/ II[/ill 


V 2 = u 2 / II u 2 \\ 

R = V\V 2 

S 2 = SS(Error from intrablock analysis) 

2 

= SS (Error from interblock analysis) 

= E (Bj — kp — E n ij x i \ 

;=1 \ i=l / 

Tj = 5 2 + ^-||t/i|| 2 

k 

T 2 = s* +^—\\u 2 \\ 2 

k 

a = min(T\/T 2 , 1 ) 

y = l/(a + 1) 

P = P value for testing Hq using intrablock analysis 
P* = P value for testing Hq using interblock analysis, based on 


F statistic 


F* 


(b - t)(r - X) \\U 2 r 


k{t - 1) 
Zi = —i n P 
z 2 = -4P* 


s * 2 


with t — l and b — t d.f. 


Z = Y z \ + (1 — Y) Z 2 
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Then, for y ^ the P value for the exact test is given by 

P** = \ye~ z/Y - (1 - y)e~ z/{X - y A / [(2y - 1)(1 + /?)] 

and for y = ^ by [(2z + l)e~ 2z ] /(I + R). Cohen and Sackrowitz (1989) per¬ 
formed some power simulations and showed that the exact test is more powerful 
than the usual (approximate) F test. For a more general discussion see also 
Mathew, Sinha, and Zhou (1993) and Zhou and Mathew (1993). 

We shall illustrate the test by using an example from Lentner and Bishop 
(1993). ^ i 

Example 2.2 An experiment is concerned with studying the effect of six 
diets on weight gain of rabbits using 10 litters of three rabbits each. The data are 
given in Table 2.1 A. The parameters of the BIB design are ? = 6, Z? = 10, k = 
3, r = 5, A. = 2. 

The intrablock analysis using SAS PROC GLM is presented in Table 2.1 A 
yielding F = 3.16 and P = 0.0382. In addition, Table 2.1 A gives the SAS solu¬ 
tions to the normal equations, which are then used to compute the set of five 
orthonormal contrasts based on 


/ -5/V70 

-3/V70 

-i/Vto 

i/Vto 

3/VlO 

5/V70 \ 

5/V84 

-1/V84 

-4/V84 

-4/V84 

-1/V84 

5/V84 

-5/VI80 

7/^180 

4/VI80 

-4/^180 

-i/Vm 

5/v^l80 

1/V28 

-3/V28 

2/V28 

2/V28 

-3/V28 

1/V28 


V-1/V252 5/V252 -10/V252 10/V252 -5/V252 1/^252 ) 


The linear, quadratic, cubic, quartic, and quintic parameter estimates represent 
the elements of t/i, yielding || U\\\ 2 = 39.6817. 

A solution to the interblock normal equations [using L = / in (1.48)] is given 
in Table 2.IB. 

The estimates r* are then used to obtain U 2 and || U 2 || 2 = 537.5284. We also 
obtain from Table 2. IB F* = 2.23 and P* = 0.2282. 

Using S 2 = 150.7728 from Table 2.1 A and 5* 2 = 577.9133 from Table 2.1B, 
all other values needed for the test can be computed. With y = 0.7828 we obtain 
Z = 2.8768, and finally P** = 0.03. This result is in good agreement with the P 
value of 0.0336 obtained from the combined analysis using SAS PROC MIXED 
with the REML option (see Table 2.1 A). □ 
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Table 2.1 A Data, Intrablock, and Combined Analysis for BIB Design 
(t = 6,b = 10,r = 5,k = 3, LAMBDA=2) 


options nodate pageno=l ； 
data rabbitl ； 
input B T Y @@； 
datalines ； 

1 6 42.2 1 2 32.6 1 3 35.2 

2 3 40.9 2 1 40.1 2 2 38.1 

33 34.6 3 6 34.3 3 4 37.5 
4144.94540.84343.9 
55 32.0 5 3 40.9 5 4 37.3 

6 2 37.3 6 6 42.8 6 5 40.5 

7 4 37.9 7 1 45.2 7 2 40.6 

81 44.0 8 5 38.5 8 6 51.9 
94 27.5 9 2 30.6 9 5 20.6 

10 6 41.7 10 4 42.3 10 1 37.3 
/ 

run ； 

proc print data=rabbitl; 

titlel 'TABLE 2.1A'; 

title2 'DATA FOR BIB DESIGN'; 

title3 1 (t = 6, b=10, r=5, k=3, LAMBDA=2 )'； 

run ； 

proc glm data=rabbitl ； 
class B T ； 

model Y=B T/solution ； 

estimate 1 linear ' T -5 -3 -1 1 3 5/divisor=8 .3661; 
estimate 1 quad' T 5 -1 -4 -4 -1 5/divisor=9.1652 ； 
estimate 1 cubic' T -5 7 4 -4 -7 5/divisor=13.4164 ； 
estimate 1 quartic' T 1 -3 2 2 -3 l/divisor=5.2915; 
estimate 1 quintic' T -1 5 -10 10 -5 l/divisor=15.8745; 
title2 'INTRA-BLOCK ANALYSIS'; 
title3 'WITH ORTHONORMAL CONTRASTS'; 
run ； 

proc mixed data=rabbitl; 

class B T ； 

model Y=T/solution ； 

random B ； 

lsmeans T ； 

title2 'COMBINED 工 NTRA- AND 工 NTER-BLOCK ANALYSIS 1 ; 
title3 '(USING METHOD DESCRIBED IN SECTION 1.8)'; 
run ； 




SIGNIFICANCE TESTS 


83 


Table 2.1 A {Continued) 


Obs 

B 

T 

Y 

1 

1 

6 

42.2 

2 

1 

2 

32.6 

3 

1 

3 

35.2 

4 

2 

3 

40.9 

5 

2 

1 

40.1 

6 

2 

2 

38.1 

7 

3 

3 

34.6 

8 

3 

6 

34.3 

9 

3 

4 

37.5 

10 

4 

1 

44.9 

11 

4 

5 

40.8 

12 

4 

3 

43.9 

13 

5 

5 

32.0 

14 

5 

3 

40.9 

15 

5 

4 

37.3 

16 

6 

2 

37.3 

17 

6 

6 

42.8 

18 

6 

5 

40.5 

19 

7 

4 

37.9 

20 

7 

1 

45.2 

21 

7 

2 

40.6 

22 

8 

1 

44.0 

23 

8 

5 

38.5 

24 

8 

6 

51.9 

25 

9 

4 

27.5 

26 

9 

2 

30.6 

27 

9 

5 

20.6 

28 

10 

6 

41.7 

29 

10 

4 

42.3 

30 

10 

1 

37.3 


The GLM Procedure 


Class Level Information 


Class Levels Values 

B 10 123456789 10 

T 6 1 2 3 4 5 6 
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Table 2.1 A (Continued) 

Number of observations 30 

Dependent Variable : Y 


Sum of Mean 


Source 

DF 

Squares 

Square 

F Value 

Pr > F 

Model 

14 

889.113889 

63.508135 

6.32 

0.0005 

Error 

Corrected 

15 

150.772778 

10.051519 



Total 

29 

1039.886667 





R-Square Coeff Var Root MSE Y Mean 

0.855010 8.241975 3.170413 38.46667 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

B 

9 

730.3866667 

81.1540741 

8.07 

0.0002 

T 

5 

158.7272222 

31.7454444 

3.16 

0.0382 

Source 

DF 

Type 工 II SS 

Mean Square 

F Value 

Pr > F 

B 

9 

595.7352222 

66.1928025 

6.59 

0.0008 

T 

5 

158.7272222 

31.7454444 

3.16 

0.0382 




Standard 



Parameter 


Estimate 

Error 

t Value 

Pr > |t| 

linear 


0.68326421 

1.58518760 

0.43 

0.6726 

quad 


2.35674071 

1.58519809 

1.49 

0.1578 

cubic 


3.14664639 

1.58520742 

1.99 

0.0657 

quartic 


4.74975590 

1.58520728 

3.00 

0.0090 

quintic 


1.09504761 

1.58520728 

0.69 

0.5002 




Standard 



Parameter 


Estimate 

Error 

t Value 

Pr > |t| 

Intercept 


42.61111111 

B 2.24182052 

19.01 

< . 0001 


B 1 -3.29722222 B 2.79604144 -1.18 0.2567 

0.83611111 B 2.79604144 0.30 0.7690 


B 


2 
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Table 2.1 A {Continued) 






Standard 



Parameter 


Estimate 


Error 

t Value Pr > |t| 

B 

3 

-5.10000000 

B 

2.69433295 

-1.89 

0.0778 

B 

4 

5.49722222 

B 

2.79604144 

1.97 

0.0681 

B 

5 

-0.99166667 

B 

2.79604144 

-0.35 

0.7278 

B 

6 

2.11111111 

B 

2.79604144 

0.76 

0.4619 

B 

7 

2.48055556 

B 

2.69433295 

0.92 

0.3718 

B 

8 

6.13055556 

B 

2.69433295 

2.28 

0.0380 

B 

9 

-10.77777778 

B 

2.79604144 

-3.85 

0.0016 

B 

10 

0.00000000 

B 




T 

1 

-3.30000000 

B 

2.24182052 

-1.47 

0.1617 

T 

2 

-5.04166667 

B 

2.24182052 

-2.25 

0.0400 

T 

3 

-2.90000000 

B 

2.24182052 

-1.29 

0.2154 

T 

4 

-3.23333333 

B 

2.24182052 

-1.44 

0.1698 

T 

5 

-8.52500000 

B 

2.24182052 

-3.80 

0.0017 

T 

6 

0.00000000 

B 




NOTE: The 

X'X 

matrix has been 

found to be 

singular, 

and 


a generalized inverse was used to solve the normal 
equations. Terms whose estimates are followed by the 
letter 'B' are not uniquely estimable. 


COMBINED 工 NTRA- AND INTERBLOCK ANALYSIS 
(USING METHOD DESCRIBED IN SECTION 1.8) 


The Mixed Procedure 
Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Estimation Method 
Residual Variance Method 


W0RK.RABBIT1 

Y 

Variance Components 

REML 

Profile 


Fixed Effects SE Method Model-Based 

Degrees of Freedom Method Containment 


Class Level Information 
Class Levels Values 
B 


T 


10 

6 


123456789 10 
1 2 3 4 5 6 
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Table 2.1 A (Continued) 

Dimensions 


Covariance Parameters 2 
Columns in X 7 
Columns in Z 10 
Subjects 1 
Max Obs Per Subject 30 
Observations Used 30 
Observations Not Used 0 
Total Observations 30 


Iteration History 


Iteration 

Evaluations 

-2 Res Log Like 

Criterion 

0 

1 

160.26213715 


1 

2 

150.36288495 

0.00010765 

2 

1 

150.35691057 

0.00000054 

3 

1 

150.35688183 

0.00000000 


Convergence criteria met. 



Covariance 

Parameter Estimates 



Cov Parm 

Estimate 



B 

21.6953 



Residual 

10.0840 



Fit 

Statistics 



-2 Res Log Likelihood 150.4 
AIC (smaller is better) 154.4 
AICC (smaller is better) 154.9 
BIC (smaller is better) 155.0 


Solution for Fixed Effects 
Standard 

Effect T Estimate Error DF t Value Pr > |t| 

Intercept 42.3454 2.1303 9 19.88 <.0001 

T 1 -2.8100 2.2087 15 -1.27 0.2227 




SIGNIFICANCE TESTS 


87 


Table 2.1 A {Continued) 

Solution for Fixed Effects 


Effect 

T 

Estimate 

Standard 

Error 

DF 

t Value 

Pr > |t| 

T 

2 

-5.3172 

2.2087 

15 

-2.41 

0.0294 

T 

3 

-2.9941 

2.2087 

15 

-1.36 

0.1953 

T 

4 

-3.6952 

2.2087 

15 

-1.67 

0.1150 

T 

5 

-8.4560 

2.2087 

15 

-3.83 

0.0016 

T 

6 

0 







Type 3 

Tests of 

Fixed 

Effects 



Effect 

Num DF 

Den DF 

F Value 

Pr > F 


T 5 15 3.28 0.0336 


Least Squares Means 


Effect 

T 

Estimate 

Standard 

Error 

DF 

t Value 

Pr > |t| 

T 

1 

39.5354 

2.1303 

15 

18.56 

<.0001 

T 

2 

37.0282 

2.1303 

15 

17.38 

< . 0001 

T 

3 

39.3513 

2.1303 

15 

18.47 

<.0001 

T 

4 

38.6502 

2.1303 

15 

18.14 

< . 0001 

T 

5 

33.8894 

2.1303 

15 

15.91 

<.0001 

T 

6 

42.3454 

2.1303 

15 

19.88 

< . 0001 


Table 2. IB Data of Block Totals and Interblock Analysis 


options nodate pageno=l ； 
data rabbit2 ； 

input y xl x2 x3 x4 x5 x6; 
datalines ； 

110.0 011001 

119.1 111000 

106.4 001101 

129.6 101010 

110.2 001110 

120.6 010011 

123.7 110100 

134.4 100011 
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Table 2.IB (Continued) 

78.7 010110 
121.3 100101 
/ 

run ； 

proc print data=rabbit2; 

titlel 'TABLE 2.1 B'; 

title2 'DATA OF BLOCK TOTALS'; 


proc glm data=rabbit2 ； 
model y=xl x2 x3 x4 x5 x6; 
title2 'INTER-BLOCK ANALYSIS'; 
run ； 


Obs 


y 


xl x2 x3 x4 


x5 x6 


1 110.0 0 

2 119.1 1 

3 106.4 0 

4 129.6 1 

5 110.2 0 

6 120.6 0 

7 123.7 1 

8 134.4 1 

9 78.7 0 

10 121.3 1 


110 
110 
Oil 
0 10 

Oil 
10 0 

10 1 

0 0 0 

10 1 

0 0 1 


0 1 
0 0 
0 1 
1 0 
1 0 
1 1 
0 0 
1 1 
1 0 
0 1 


INTERBLOCK ANALYSIS 


The GLM Procedure 


Number of observations 10 


The GLM Procedure 


Dependent Variable : y 


Source 


DF 


Sum of Mean 

Squares Square F Value Pr > F 


Model 

Error 


5 1613.246667 322.649333 2.23 0.2282 

4 577.913333 144.478333 


Corrected Total 9 2191.160000 
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Table 2.IB {Continued) 


R-Square Coeff Var Root MSE y Mean 

0.736252 10.41587 12.01991 115.4000 


Source 

DF 

Type 工 SS 

Mean Square 

F Value 

Pr > F 

xl 

1 

1044.484000 

1044.484000 

7.23 

0.0547 

x2 

1 

89.792667 

89.792667 

0.62 

0.4746 

x3 

1 

10.454444 

10.454444 

0.07 

0.8012 

x4 

1 

407.075556 

407.075556 

2.82 

0.1685 

x5 

1 

61.440000 

61.440000 

0.43 

0.5499 

x6 

0 

0.000000 








Standard 



Parameter 

Estimate 

Error 

t Value 

Pr > |t| 

Intercept 

131.1000000 

B 

19.38152901 

6.76 

0.0025 

xl 

11.8000000 

B 

9.81421871 

1.20 

0.2955 

x2 

-13.5333333 

B 

9.81421871 

-1.38 

0.2400 

x3 

-5.8000000 

B 

9.81421871 

-0.59 

0.5863 

x4 

-17.4666667 

B 

9.81421871 

-1.78 

0.1497 

x5 

-6.4000000 

B 

9.81421871 

-0.65 

0.5499 

x6 

0.0000000 

B 





2.7 SOME SPECIAL ARRANGEMENTS 

In some cases it is possible to arrange the treatments in the blocks of a BIB 
design in a special way that may lead to a reduction of either or cr^ or both. 
These arrangements lead therefore to more efficient designs. 

2.7.1 Replication Groups Across Blocks 

Designs of this type are characterized by the fact that the first s positions in the 
b blocks form a group of / replications of all t treatments, the same being true 
for the next s positions, and so forth. If there are s f such groups, then s? = r. 
We illustrate this type of arrangement in Example 2.3. 

Example 2.3 Consider the BIB design with parameters t = 10, r = 6, b = 
15, k = 4, X = 2 given by the following plan: 
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Replication Group 


Block I II 


1 

1 

2 

3 

2 

1 

5 

2 

3 

1 

3 

7 

4 

4 

9 

1 

5 

5 

7 

1 

6 

6 

8 

1 

7 

3 

9 

2 

8 

2 

4 

7 

9 

5 

10 

2 

10 

2 

7 

8 

11 

3 

10 

5 

12 

6 

10 

3 

13 

4 

8 

3 

14 

6 

7 

4 

15 

8 

9 

4 


4 
6 
8 

10 

9 

10 
6 
10 
8 
9 
9 
7 

5 

5 

6 


It can be verified easily that for this design s = 2, s f = 2, / = 3. □ 

The analysis is based on the following model: 

y = fiO + X r x + + Xyy + e (2.23) 


yi jt = M D jt 

where r，and Pj are as defined previously and yt is the (fixed) effect of the €th 
replication group (£ = 1,2,, s f ). Since the replication groups are orthogonal to 
blocks and to treatments, the procedure for estimating linear contrasts of treatment 
effects is the same as before. More specifically, the RNE (2.9) are unchanged 
and the combined estimators are obtained by using (2.17). 

The only change occurs with regard to the estimation of This is reflected 
in the analysis of variance according to model (2.23) as given in Table 2.2. The 
form of this analysis of variance table shows a certain similarity to that of the 
Latin square design except that treatments and blocks are not orthogonal. But 
having sources of variation due to blocks and replication groups orthogonal to 
blocks achieves elimination of heterogeneity in two directions and hence leads 
to a reduction of the residual variance. 

As can be seen from Example 2.3 and model (2.23), the replication groups 
play the same role as the columns in a Latin square type of design. It is for this 
reason that such designs have also been referred to as Latinized incomplete block 
designs (Harshbarger and Davis, 1952; John and Williams, 1995). 
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Table 2.2 ANOVA for BIB Design with Replication 
Groups Across Blocks 


Source 

d.f. 

SS 

Xp\3,X y 

b- 1 

1 V- 2 G 2 

j 

Xy\0,X^X r 

/-I 

1 V- 7 G 2 

— Ri - 

r't ^ n 

x T \o,x^,x y 

t - 1 

i 

i\d,x^x t x t 

n — t — b — s f -\-2 

Subtraction 

Total 

n — 1 



Ri = total of ith replicate. 


As a special case of designs described in this section we mention the following 
definition. 

Definition 2.3 A BIB design that can be arranged in replication groups across 
blocks, each group containing a complete replicate (i.e., / = l, s f = r = k), is 
called a Youden square design (see 1.10.5). 

These designs, which for obvious reasons are also called incomplete Latin 
square designs ， were first introduced by Youden (1937, 1940). They overcome 
the most severe objection to Latin square designs, namely that the number of 
replications (of each treatment) is the same as the number of treatments, t, which 
for large t is difficult to achieve from a practical point of view. Also, the block 
size in a Latin square design will then usually become too large. 

It was shown by Hartley and Smith (1948) that for all incomplete block designs 
in which the number of treatments is equal to the number of blocks, such arrange¬ 
ments exist. Their proof provides, in fact, a general procedure of constructing a 
Youden square from a given symmetrical BIB design. This is accomplished by 
repeatedly interchanging the positions of pairs of treatments in the same block. 
A list of Youden squares is given by Cochran and Cox (1957, 1992). □ 

2.7.2 Grouped Blocks 

Designs of this type are characterized by the fact that the first Z/ blocks form a 
group of a replicates of the t treatments, so do the next b' blocks, and so on, 
these groups of blocks being thereby orthogonal to treatments. If there are q of 
such groups, then qb’ = b and qa = r. 
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Following Shrikhande and Raghavarao (1963) we formalize this in the fol¬ 
lowing definition. 

Definition 2.4 A BIB design with parameters t ， b ， k ， r，"k that can be arranged 
in groups of blocks, each group containing a replicates of the t treatments, is 
said to be a-resolvable. 

This concept of a-resolvability is an extension of the concept of resolvability 
given by Bose (1942), which refers to the case a = l. 

The following design is an example of a 3-resolvable design. □ 

Example 2.4 Consider the following design with parameters t = t, b = 10, 
A ： = 3, 4 = 6, A. = 3: 


Replication 

Block Treatment Group 

1 12 3 

2 1 2 5 

3 14 5 I 

4 2 3 4 

5 3 4 5 

6 1 2 4 

7 13 4 

8 13 5 II 

9 2 3 5 

10 2 4 5 


We find b’ = 5, q = 2, a = 3. □ 

For a-resolvable BIB designs inequality (2.6) can be improved as follows. 

Theorem 2.2 If a BIB design with parameters t, b, k, r, X is a-resolvable, 
then the following inequality must hold: 

b t ci — 1 (2.24) 

Proof Since for an a-resolvable BIB design there exist q — I linearly inde¬ 
pendent relationships between the columns of N ， we have 

t = rank(iV^V’）= rank(A^) < b — q l 


which implies (2.24). 


□ 



SOME SPECIAL ARRANGEMENTS 


93 


Definition 2.5 An a-resolvable BIB design is called affine a-resolvable if 
any pair of blocks in the same replication group have q\ treatments in common 
and if any pair of blocks from two different replication groups have 的 treatments 
in common. □ 

For affine a-resolvable BIB designs the following results are due to Shrikhande 
and Raghavarao (1963). 

Theorem 2.3 In an affine a-resolvable BIB design k 2 /t is an integer. 

Proof Without loss of generality consider the first block in the first replication 
group, B\\ say. By definition, B\\ has q 2 treatments in common with every block 
in the remaining q — I replication groups, each group consisting of b/q blocks. 
Hence 

b 

(q -l) - q 2 = k{r - a) 

q 

which, since aq = r, yields q 2 = k 2 /t. Since q 2 is an integer, k 2 /t is an integer. 

Theorem 2.4 For an affine a-resolvable BIB design, the equality 

h = t q — l (2.25) 

holds. 


Proof We shall give here the proof for a = 1 following Bose (1942) and 
refer to Shrikhande and Raghavarao (1963) for the general case. 

Consider a resolvable BIB design. We then have 

t = b，k b = b f r (2.26) 

since the b blocks are divisible into r sets of b r blocks each, each set containing 
each treatment exactly once. Let the blocks belonging to the ith set, Si , be denoted 
by Bn, B 。， ..., Buy for i = 1, 2,..., r. Consider now a particular block, B\\ 
say. Let iij be the number of treatments common to blocks B\\ and B"(i = 
2, ... ， r; j = 1 ， 2, … ， b’）. Further, let m denote the average and s 2 the variance 
of the b\r — 1) numbers t". 

Now each of the k treatments in B\\ is replicated r times, and since the 
design is resolvable, that is, 1-resolvable, a given treatment in B\\ will occur in 
exactly one block each in the r — 1 other sets, S 2 , S 3 ,..., S r . This is true for all 
k treatments in B\\, and hence 


= Hr - 1) (2.27) 

ij 
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and therefore 


k k 1 


(2.28) 


using (2.26) for the last expression. 

Further, the k(k — 1)/2 pairs of treatments in B\\ each occur 入 一 1 times 
together in the sets S 2 , S 3 ,, S r . Hence 


工 w 1 ) = ^( 入-聯 - 1 ) 


and therefore, using (2.27) 




Since 


we rewrite (2.29) as 


E- 


r(k - 1) _ r(k - 1) 
t - 1 = b'k- 1 


_ ， k- \){r - k) + r{k - l) 2 ] 
b'k - 1 


Then we find after some algebra, using (2.28), (2.29), and (2.26), 

公 £" - m ) 2 
2 _ ji _ 

b f (r — 1) 


E4 


b\r - 1) 


m 


(2.29) 


k(t-k)(b-t-r+\) 
b /2 (r - 1 )( 卜 1)~ 


(2.30) 


Since s 2 > 0, (2.30) implies, of course, the earlier result (2.24), with q = r for 
a = l. 


We now consider an affine resolvable BIB design. Then, by definition, tij = q 2 
for all i, j. Hence 夕 2 of (2.30) equals zero, which implies (2.25) with q = r for 
a 1-resolvable BIB design. □ 
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Corollary 2.1 If the parameters of a BIB design satisfy (2.25) and k 2 /t is 
not an integer, then the design is not resolvable. 

Example 2.5 An example of an affine resolvable BIB design is given by 
the following design with parameters t = S,b = 14, k = 4, r = 1, X = 3, = 

2 , q = 1 , q\ = 0, q 2 = 2 , a = l : 

Replication 

Block Treatments Group 

1 12 3 4 

I 

2 5 6 7 8 

3 12 7 8 

II 

4 3 4 5 6 

5 13 6 8 

III 

6 2 4 5 7 

7 14 6 7 

IV 

8 2 3 5 8 

9 12 5 6 

V 

10 3 4 7 8 

11 13 5 7 

VI 

12 2 4 6 8 

13 14 5 8 

VII 

14 2 3 6 7 

A natural model for the analysis of such a design is 

j = /x3 + Z r r + + + e (2.31) 

or 

yiji = fi + Ti + r]j + + eijl 

where r]j is the (fixed) effect of the jth replication group (j = 1 , 2 ,... ,q) and 
is the effect of the Ith block in the jth group (i = 1,2,..., b f ). This brings 
out the point that the blocks are nested in the replication groups and, hence, in the 
case of the ❿ being random variables (as assumed for the combined analysis), 
that cr^ now measures the variability of the blocks within groups. Since b r < b. 



96 


BALANCED INCOMPLETE BLOCK DESIGNS 


We find that r = 5, s = 2, a = 2, q = 5 in the notation of Sections 2.7.1 
and 2.7.2. □ 

A natural model for this type of design is 

y = fiO + X r t + Xyy + + X b* P* + e (2.32) 


it can then usually be concluded that for this design will be smaller than the 
corresponding cr^ in the BIB design without grouping of blocks. Consequently, 
only the part of the analysis of variance that deals with the estimation of aj* 
(to be used for the combined analysis, in particular, using the Yates estimator) 
is affected when using this design and model (2.31). This is shown in Table 2.3, 
where Rj refers to the total of the jth replication group, and Bji refers to the 
total of the £th block within the jth replication group. □ 

2.7.3 a-Resolvable BIB Designs with Replication Groups Across Blocks 

The designs of this type combine the properties of the designs discussed in 
Sections 2.7.1 and 2.7.2 and hence lead to a possible reduction in both of and 

Example 2.6 As an example we consider the following BIB design with 
parameters t = 6, b = 15, k = 4, r = 10, X = 6: 


Block 


Replication 

Group 


Z? = 15, A ： = 4, r = 10, A = 6: 


Replication Groups 
Across Blocks 



I 



II 

1 


2 

3 

4 

1 


5 

4 

6 

2 


3 

5 

6 

1 


2 

3 

5 

1 


2 

4 

6 

3 


5 

4 

6 

1 


3 

2 

6 

4 


5 

1 

3 

4 


6 

2 

5 

2 


4 

1 

5 

5 


6 

1 

3 

3 


6 

2 

4 


n mIVv 


4 


4 


0 12 3 4 5 
11 11 11 11 11 11 
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Table 2.3 Analysis of Variance for Resolvable BIB Design 


Source 

d.f. 

SS 

E(US) 

x z \0,x v 

t - 1 

卜 

i 

G 2 

n 


分 -1 

j 

G 2 

n 

X^\3,X x ,X v 

b — q 

rv 坪 2 , n — t — Kq _ i) 2 

Ditterence err H - ctd* 

b - q p 


n — t — b \ 

ijH i 



j 見 


Total 

n — 1 

ijf- 

G 2 

n 


or 


yijtu = M + A + Yj ~\r Pi u H - ^ijlu 

where all the parameters are as previously defined. 

The analysis of variance associated with (2.32) follows easily from those 
presented in Tables 2.2 and 2.3 with the partitioning of the total d.f. as given in 
Table 2.4. 


Table 2.4 Outline of Analysis of Variance 
for Model (2.32) 


Source 

d.f. 


t- 1 

XylO^X^Xrj 

〆 一 1 

X^^X^Xy 

Q - 1 


b — q 

I\3,X T ,Xy,X^X^ 

n — t — b — s r -\-2 

Total 

n — 1 
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2.8 RESISTANT AND SUSCEPTIBLE BIB DESIGNS 
2.8.1 Variance-Balanced Designs 

The class of BIB designs belongs to the class of variance-balanced designs, that 
is, designs for which every normalized estimable linear function of the treatment 
effects is estimated with the same variance. It is easy to establish the following 
theorem. 


Theorem 2.5 A necessary and sufficient condition for a connected design to 
be variance balanced is that its C matrix is of the form 


C = Cl I+ c 2 03' 




then 


oco 


<D 0^ 

,0 0； 


where D = diag ( 必 ) .Then the RNE 


can be written as 


Cr= Q 

OCOOr = O' Q 


(2.33) 


Proof For an IBD with rank C = t — 1 let H … ， U, = 
be a set of orthonormal eigenvectors of C. If we write 


Hence 



： )(T)=(T) 


Q\t = D 1 OiQ 


with 


varCO^r) = D l 0\CO x D l 
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For a variance-balanced design we must then have D = dl. It then follows 
that 


C = V 

\o / 0/ 


= dO x O\ 




[see (1.17)] which we can write as 


C = c\I -\- C233 / 

which is (2.33). Obviously, if C is of the form (2.33), the design is variance 
balanced. □ 


Further, among all binary, equireplicate, proper designs the BIB designs are 
the only variance-balanced designs. There do exist, however, variance-balanced 
designs with unequal block sizes, for example. 


2.8.2 Definition of Resistant Designs 

An interesting question then is: If from a BIB design one or several treatments are 
deleted, is the resulting design still variance balanced? In general the answer will 
be negative, but for some designs the answer will be, more or less, positive. Such 
designs have been called resistant BIB designs by Hedayat and John (1974) who 
have also given a characterization of such designs and have shown their existence. 
We shall give some results. 

Let D be a BIB design with parameters t, b, k, r, X and let D* be the design 
obtained from D by deleting all experimental units that received a certain subset 
L of v(< t — 2) treatments. Following Hedayat and John (1974) we then consider 
the following definitions. 

Definition 2.6 The design D is said to be globally resistant of degree v if 
Z)* is variance balanced when any subset L of v treatments is deleted. □ 


Definition 2.7 The design D is said to be locally resistant of degree v if D* 
is variance balanced only with respect to certain subsets L of v treatments. □ 

Definition 2.8 The design D is said to be susceptible if there exist no subsets 
L such that D* is variance balanced. □ 
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2.8.3 Characterization of Resistant Designs 

Of particular interest, from a practical point of view, are resistant designs (locally 
and globally) of degree 1. It may happen that the researcher feels that a particular 
treatment, say a drug, may have to be discontinued before the termination of the 
experiment. If a BIB design had been chosen for this experiment to achieve 
equal precision for treatment comparisons, then such a course of action might 
destroy this feature, unless the BIB design is globally resistant of degree 1 or 
locally resistant of degree 1 with respect to treatment 0 , say, and the treatment 
in question has been assigned the number 0 . 

To characterize suitable designs, let D denote a BIB design with parameters t, 
b ， k, r, k and let D be divided into two parts Dq and D 百 . Design Dq consists of 
all the blocks of D that contain the treatment 0, and Dq consists of the remaining 
blocks of D that do not contain the treatment 0. If N, N$, Nq are the incidence 
matrices of D, Dq ， and D 百 , respectively, then 

N = (N e ， N$ (2.34) 

with 

NN r = N e N' e + N w N f w = (r - X)I + X03 r 

Now suppose that treatment 0 has been deleted from D, that is, from Dq. Call 
the resulting designs D* and respectively, and their corresponding incidence 
matrices N* and Nq. Also, let be the incidence matrix obtained from 
in (2.34) by deleting the row (of zeros) corresponding to 0, and call the corre¬ 
sponding design D^. We then have the following theorem due to Hedayat and 
John (1974). 

Theorem 2.6 The BIB design D is locally resistant with respect to 0 if and 
only if D-q is a BIB design. 

Proof Let C* denote the C matrix for D*. Then, with 

， = (N；, Np 

a (t — l) x b incidence matrix, we have 

c*= rl t - x - YTjN* e N* g ' - \n*^ (2.35) 

Since Dq and are derived from the BIB design D ， we know that if the general 
(off-diagonal) element in is k”, the corresponding element in NqNq must 

be 入一 乂小 Hence, the diagonal and off-diagonal elements of C* are 



RESISTANT AND SUSCEPTIBLE BIB DESIGNS 


101 


Hence C of (2.35) is of the form (2.33) if and only if all Xfj are equal, that is, if 
Dj and hence Dq is a BIB design (with parameters t\ = t — l, b\ = b — r,k\ = 
k, r\ = r — 入，入 i). □ 

We illustrate this theorem with the following example. 

Example 2.7 The following plan, D, is a BIB design with parameters t = 
S,b = 14, A: = 4, r = 7, A, = 3: 

12 3 4 

5 6 7 8 

12 7 8 

3 4 5 6 

13 6 8 

2 4 5 7 

14 6 7 

2 3 5 8 

12 5 6 

3 4 7 8 

13 5 7 

2 4 6 8 

14 5 8 

2 3 6 7 

Let 0 = 1; then D-r is given by 








入 
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This is a BIB design with t\ =l,b\ = 7, k\ = 4,r\ = 4, = 2. Hence D is 

resistant w.r.t. treatment 1. 

The design D* obtained from design D in Example 2.7 provides an illustration 
of the construction of an equireplicate, variance-balanced incomplete block design 
for t* = 7 treatments with unequal block sizes. In particular, we have here b* = 
14 blocks, with 7 blocks of size k* = 3 and 7 blocks of size = 4. □ 

Corollary 2.2 The BIB design D is globally resistant of degree 1 if and 
only if Dq is a BIB design for all 6 of D. 

Corollary 2.3 The property of being a resistant BIB design depends not only 
on the parameters of D but also in the way D has been constructed. 


An interesting example of this is provided by Hedayat and John (1974). 

Corollary 2.4 The BIB design D is locally resistant w.r.t. 0 if and only if 
every triple containing 0 appears the same number of times in D. 

Proof From the proof of Theorem 2.6 it follows that if Dj is a BIB design, 
then also Dq is a BIB design, that is, every pair of treatments (other than 0) 
appears together the same number of times. Hence in Do (and therefore in D) 
every triple containing 0 appears together the same number of times. □ 


This leads one to another characterization of globally resistant designs of 
degree 1. Before we state this we give the following definition. 

Definition 2.9 An incomplete block design in which each triple of treatments 
occurs together the same number of times, 5, is referred to as a doubly balanced 
incomplete block (DBIB) design. 

This definition is due to Calvin (1954), and such designs and their properties 
have been discussed by Calvin (1954) and Raghavarao and Thartare (1967, 1970). 
The design given in Example 2.7 is a DBIB design with <5=1. □ 

Based on Definition 2.9 and the proof of Corollary 2.4 we then have the 
following theorem. 

Theorem 2.7 A BIB design D is globally resistant of degree 1 if and only 
if it is a DBIB design. 

Existence, construction, and properties of resistant BIB designs have been 
discussed by Hedayat and John (1974) and John (1976), and we shall not go into 
this, except for the following theorem. 

Theorem 2.8 Every symmetric BIB design with blocks of size k is locally 
resistant of degree k. 
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The proof follows immediately from the construction procedure of BIB designs 
described in Section 3.2. 

An interesting follow-up to Theorem 2.8 is provided by Baksalary and Puri 
(1990). They show that the symmetry of the design is both necessary and suf¬ 
ficient for the local resistance of degree k with respect to L consisting of k 
treatments that occur in exactly one block of the design if we demand D* to be 
a BIB design (rather than just a variance-balanced design). 

2.8.4 Robustness and Connectedness 

Another form of robustness of BIB designs is concerned with the question to what 
extent the unavailability of observations, such as missing or lost observations, 
affects the property of connectedness of the BIB design. 

Following Ghosh (1982) we give the following definition. 

Definition 2.10 A BIB design is said to be robust against unavailability of 
any s observations if the block design obtained by omitting any s observations 
remains a connected design. □ 

Using the notion of a bipartite graph, consisting of the subsets T = 
(1,2,..., t) and ® = (1, 2, .. b), and the notion of connectedness (see 
Section 1.5) Ghosh (1982) then proves the following theorems. 

Theorem 2.9 A BIB design is robust against the unavailability of any 5 1 < 
r — 1 observations. 

Theorem 2.10 A BIB design is robust against the availability of all obser¬ 
vations in bo < r — l blocks. 

With regard to Theorem 2.10, of particular interest is the case bo = 1. This 
case is considered by Srivastava, Gupta, and Dey (1990) who show that the 
efficiency factor for the resulting design relative to the original BIB design is 
given by 

E = (Xt - k)(t - l)/(Xt 2 -Xt -tk + k 2 ) 

Considering many of the existing BIB designs, they found that for most designs 
E > .80. 



CHAPTER 3 


Construction of Balanced 
Incomplete Block Designs 


3.1 INTRODUCTION 

In Chapter 2 we discussed at great length the nature of BIB designs and their 
associated analyses. Although extensive lists of actual plans of such designs are 
available (see Cochran and Cox, 1957; Fisher and Yates, 1957; Beyer, 1991), 
these lists do not cover all presently existing BIB designs. Also, it is desirable 
to have some understanding of the combinatorics of design and of the result¬ 
ing algebras. The actual structure has, naturally, an impact on the analysis of 
resulting data. 

Several methods of constructing BIB designs have been introduced (for refer¬ 
ences, see Raghavarao, 1971) using such mathematical tools as finite Euclidian 
and projective geometries. We shall present here a very powerful and at the same 
time a quite simple method, the method of cyclical development of initial blocks, 
which is due to Bose (1939). With this method one can construct most of the 
existing BIB designs (Rao, 1946b) as one can see from the essentially complete 
listing of BIB designs given by Raghavarao (1971). Mainly from a historical 
point we shall also discuss some other methods. 


3.2 DIFFERENCE METHODS 

3.2.1 Cyclic Development of Difference Sets 

The basic ideas of this method of constructing BIB designs are as follows: 

1. We have a set of t treatments that we denote by T = {^o, t\,, t t -\}. 

Design and Analysis of Experiments. Volume 2: Advanced Experimental Design 

By Klaus Hinkelmann and Oscar Kempthorne 

ISBN 0-471-55177-5 Copyright © 2005 John Wiley & Sons, Inc. 
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2. We have a set of s initial blocks, each containing k treatments. Suppose 
we denote these initial blocks by 

= (tj\i tj2-> • • • 5 ^jk) (j = 1 ， 2， .. • ， s) 

where the tji (i = 1,2,... ,k) are elements in T. The are chosen in 
a particular way such that they satisfy certain properties (which we shall 
discuss below). 

3. Each initial block is then developed cyclically, that is, by forming 

^>je = (tj\+te ， tj2 te,, tjk + te) 

for ^ = 1 , 2 ,..., ^ — 1 assuming that we have an addition rule such that 
with tji, to e 7 also tji + to g T(^ = 0 , 1 ,..., ^ — 1 ) and tji + 吣 = tji. 

4. The set of blocks {^>je\ j = 1 ， 2,.. • ， 5 1 ; 沒 = 0, 1,.. •, ， 一 1} forms then a 
BIB design with parameters t,b = st, k,r = sk ， "k. 


The crucial part of this method is obviously the choice of T and the initial 
blocks !Byo (j = 1,2,..., s). To discuss their properties we shall confine our¬ 
selves at first to the case s = l, that is, one initial block, say. Extensions to 
other situations will then become fairly obvious. 

We start with the following definition. 

Definition 3.1 Let S = 0, 1 ， ...，，一 1 be an Abelian group under addition, 
that is, for gi, g 2 ^ 5 define g\ + g 2 to be g\ + g 2 mod t and define gi — g 2 to 
be g, where g2 8 = g\ mod t. Let >1 be a subset (of 9) of k elements such that 
the k(k — 1) differences (mod t) between members of A comprise all nonzero 
elements of S exactly X times. Then A is called a difference set of size k. □ 

Example 3.1 Let ? = 7, S = {0, 1, 2, 3, 4, 5, 6}, A = {0, 1, 3}. The differ¬ 
ences between members in A are 0—1 三 6,0 — 3 三 4,1 — 0 三 1,1 — 3 三 
—2 = 5,3 — 0 = 3,3 — 1=2, and hence A, = 1. □ 

Taking T = S, we now state the main result of this chapter in the form of the 
following theorem. 

Theorem 3.1 Let /I be a difference set of size k for a group S of ? elements 
such that each nonzero difference occurs 入 times. If the set A is taken as the 
initial block and developed cyclically, it generates a symmetrical BIB design 
with parameters t，b = t，r = k ， "k. 

Proof Let A = T>o = [gi, g2, • ^ , gk} with 沿 g 3(/ = 1,2, cycli¬ 

cally developing ®o, each element in S occurs exactly once in each of the k posi¬ 
tions; hence each element occurs in the b sets (blocks) = 0 , 1 ,..., ^ — 1 ) 
exactly k times. The only property of a BIB design that needs to be shown then 
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to hold is that any two elements (treatments) occur together in 入 blocks. To see 
this, consider without loss of generality g u , g v , e S. Suppose that for g u ，, g v f e A 
we have 


gu = gu r + 0 
gv = gv r + 0 

that is, g u and g v occur together in "Bq. Then 


(3.1) 


or 


§u Su' = §v ~ §v r 
§u ~ §v = Su r 8v r 


Now g u — g v = c with c e [1,2,..., t — 1}. Since A is a. difference set, we 
know that there exist 入 differences of the form g u ，一 g v ， with the same constant 
c. Hence there exist exactly 入 solutions to (3.1) as equations in 0. Call these 
solutions H 0 人 .It then follows that g u and g v appear together in blocks 

..., which proves the theorem. □ 

As an illustration reconsider the following example. 

Example 3.2 Let ^ = 7, S = {0, 1, 2, 3, 4, 5, 6,}, A, = ®o = {0, 1, 3}, A. = 1. 
The BIB design then is 


(0 ， 1, 3) 
(1 ， 2, 4) 
(2, 3, 5) 
(3, 4, 6) 
(4, 5, 0) 
(5, 6 ， 1) 
( 6 , 0 , 2 ) 


where each row represents a block. 


□ 


It is now easy to generalize this procedure to the case of several initial blocks. 
We do this in the next theorem. 


Theorem 3.2 Let A = {®io, ®20,. •., ®^o} consist of 5- sets of k elements 
each such that among the sk{k — 1) differences between the elements in each set 
each nonzero element of S occurs exactly X times. Developing each set cyclically 
yields a BIB design with parameters t,b = st ， k，r = sk ， k. 

The proof follows along the same lines as that of Theorem 3.1. 

To illustrate Theorem 3.2 we consider the following example. 
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We note that this design could be used as a Youden square with 5 replication 
groups of 11 blocks each (see Section 2.6.1) □ 

3.2.2 Method of Symmetrically Repeated Differences 

So far the procedure of cyclic development has led to BIB designs with the 
number of blocks being a multiple of the number of treatments. As exemplified 
in Example 3.3, this can mean a rather large number of blocks. This number can 


Example 3.3 Let t = 11, y = {0, 1,..., 10}, and 

®io = (0, 1, 10) 

®20 = (0, 2, 9) 

®30 = (0, 4, 7) 

®4o = (0, 8, 3) 

®50 = (0, 5, 6) 

The differences resulting from 

®io are 10, 1, 1,2, 10,9 
®20 are 9, 2, 2, 4, 9, 7 
® 30 are 7, 4, 4, 8, 7, 3 
® 4 o are 3, 8, 8, 5, 3, 6 
® 5 o are 6, 5, 5, 10, 6, 1 

Inspection shows that 入 = 3 and hence A = {®io, • • •, ®5o} satisfies the condi¬ 
tion of Theorem 3.2. The resulting BIB design with 11 treatments in 55 blocks 
of size 3 and 15 replicates per treatment is then as follows: 


\~\ \~\ \~\ \~ / \ — \ \~~\ \~\ 、 — s \ — \ \~y 

67890012345 

5, 6, 7, 8, oC o, o, 1, 2, 3, 4, 

11 


(0,(1,(2,(3,(4, 


5, 6, 


5 0 


7 8 9 1 

/V /V /lx /V. 


\—'/ \—'\ \—'\ \—\ \—'\ \—'\ \—*/ \—'\ \—'\ \—*/ \ _ ^ 

34567890012 

11 

8 ， 9,o, o, 1 ， 2, 3, 4, 5, 6, 7, 


» o 


01234567891 

/IV /V- /IV /V- /IV /l' /IV /IV /V /(V /l\ 


\—/ \J- \)y 

78900123456 

11 

4, 5, 6, 7, 8,9,o, o, 1 ,2,3, 

11 

o’ 1 ， 2, 3, 4, 5,6 ,7, 8,9,1 o 

/—.\ /V /_\ /V /_\ V_\ /v V_\ /V V_\ /V 

\—/ \7 \7 \—/ \)/ \7 \—/ \—/ \—/ 

90012345678 

11 

2, 3, 4, 5, 6, 7, 8,9,o, o, 1 ， 

11 

o’ 1 ， 2, 3, 4, 5,6 ,7, 8 ， 9,1 o 

s...\ \^\ s...\ v\ v...\ x(v /V —\ /V x(\ \...\ 


\—/ \—/ \—/ \—/ \—/ \—/ \—/ \—/ \—/ \1/ \17 

00123456789 

11 

1 ， 2, 3, 4, 5, 6, 7, 8,9,o, o, 

11 

o’ 1，2 ,3, 4, 5,6,7’ 8,9,1 o 

/V /IV /IV /IV /IV /|\ /IV /IV /IV /IV /(\ 
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possibly be reduced, however, if the number t can be factored, say t = pq. For 
this situation the notion of a difference set and hence the construction procedure 
can be modified as follows (Bose, 1939). 

Consider a finite additive group S containing p elements, S = {a( 0 ), a^\ ..., 
say. To each element of the group S let there correspond q symbols, the 
symbols associated with a ⑻ being denoted by a^\ a^\ , a^\ Symbols with 

the subscript j are said to belong to the jth class, that is, a^\ a^\ ..., a” 
belong to the jth class. We consider differences of the form a; u ) - a?), which 
are called differences of type (/, j), and their value is given by a ⑻ — = d, 
say, where d e S- When i = j, the differences are called pure, when / ^ j the 
differences are called mixed, there being q types of pure differences and q(q — 1) 
types of mixed differences. 

We then have the following definition. 

Definition 3.2 Let A = { 怎⑴， ®20,..., be a collection of s sets satis¬ 
fying the following conditions: 

1. Each set contains k symbols with 


k = nu-\- U 2 i H - h n q £ = 1,2,... ,s) 

and riji denoting the number of symbols of the jth class in set i. 

2. Among the n u( n u — 1) pure differences of type (/, i) arising from 
the 夕 sets, every nonzero element of S is repeated exactly 入 times for each 
i — 1 , 2 , . •. ， q . 

3. Among the Y^t=\ n ii n jt mixed differences of type (/, j) arising from the s 

sets, every element of S is repeated exactly X times for every (/, j); i, j = 
1,2, _ j. 

If conditions 1-3 are satisfied, the differences are said to be symmetrically 
repeated in S, with each difference occurring X times. □ 

To illustrate Definition 3.2 we consider the following example. 


Example 3.4 

Let t = 

10 ， p = 5, (j = 

= 2,3 = 

{0,1, 2, 3, 4}, 

A : 

®10 

= (O 2 , I 2 , 22 ) 

®20 

=(1i,4i,0 2 ) 


®30 

= (2i,3i,0 2 ) 

®40 = (li,4i, 22 ) 


®50 

=(2i, 3i, 22 ) 

®60 

=(0i, O 2 , 22 ) 

that is, 





n\\ = 0 


n2i = 3 



n \2 = 2 


nn = 1 



^13 = 2 


^23 = 1 



n\4 = 2 


«24 = 1 



n\5 = 2 


^25 = 1 



n\6 = 1 


行 26 = 2 
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Pure differences of type (1, 1): 1 — 4 = —3 = 2 

^ 4-1= 3=3 

2- 3 = -1 =4 

3- 2= 1=1 

1 - 4= -3 = 2 

4- 1= 3=3 

2- 3=-1=4 

3- 2= 1 = 1 : A. = 2 


Pure differences of type (2, 2): 0 — 1 = —1 = 4 

‘ 0-2=-2=3 

1 - 0 = 1=1 

1 -2= -1 =4 
2 - 0 = 2=2 
2 - 1 = 1=1 
0-2=-2=3 

2 — 0 = 2 = 2 : A. = 2 


Mixed differences of type (1, 2): 1—0= 1 = 1 

4-0= 4=4 
2 - 0 = 2=2 

3- 0= 3=3 
1 -2 = -1 =4 

4- 2= 2=2 

2 - 2 = 0=0 
3-2= 1=1 

0 - 0 = 0=0 
0-2 =-2 三 3 :入 = 2 


Mixed differences of type (2, 1) are the negative values of the differences of type 
(1, 2): X = 2. Hence A satisfies conditions 1-3. □ 

Now we let the pq symbols (u = 1,2,..., p, j = 1,2,..., q) represent 
the t treatments. The method of cyclic development is then used again to generate 
BIB designs as described in the following theorem. 

Theorem 3.3 Let it be possible to find a collection of s sets A = {®io, ®20, 
,® 5 q} satisfying the following conditions: 


i. Among the ks symbols (treatments) occurring in the s sets (blocks), exactly 
r symbols belong to the jth class (j = 1 , 2 ,..., q), that is, ks = qr. 

ii. The differences in A are symmetrically repeated, each occurring 入 times. 
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Then, if each set in A is cyclically developed, obtaining by adding 0 e 
S to each element in "Bioil = 1, 2,... ，夕 ） and retaining the class number, the 
resulting ps sets = 1,2,..., s; 0 e 3) form a BIB design with parameters 
t = pq,b = ps, k,r = ks/q, X. 

The proof follows arguments similar to those used in the proof of Theorem 
3.1 (for more details see Raghavarao, 1971). 

We shall illustrate this procedure with a continuation of Example 3.4. 

Example 3.4 (Continued) Let t = 10 = 5 x 2, S = {0, 1, 2, 3, 4}. Develop¬ 
ing the initial blocks given earlier, we obtain the following BIB design with 
t = 10, b = 30, k = 3, r = 9, X = 2: 


(O 2 , I 2 , 22) 

(1i,4i,0 2 ) 

(2i, 3i, O 2 ) 

(I 2 , 22 , 32) 

(2i,0i,1 2 ) 

(3i, 4i, I 2 ) 

(22, 32, 42 ) 

(3i,1i,2 2 ) 

(4i, 0i, 22 ) 

(32, 42, O 2 ) 

(4i,2! ， 3 2 ) 

(0i, li, 32 ) 

(42, O 2 , I 2 ) 

(0i, 3i, 42) 

(1i,2i,4 2 ) 

(11,4!,2 2 ) 

(2i,3i,2 2 ) 

(0i, O 2 , 22 ) 

(2i, 0i, 32) 

(3i,4i,3 2 ) 

(li, I 2 , 32) 

(3i, li, 42) 

(4i,0i,4 2 ) 

(2i, 22, 42) 

(4i, 2i, O 2 ) 

(0i, li ， Oo) 

(3i, 32, O 2 ) 

(0i, 3i, I 2 ) 

(1i,2! ， 1 2 ) 

(4i, 42, I 2 ) 


We can then identify the symbols with the treatments in any way we want to, 
such as 

Symbol 0i O 2 1 1 I 2 2 之 3i 32 4i 4 〗 

Treatment 123456789 10 □ 

Further modifications of the procedures discussed so far can be achieved 
by considering the set of treatments Too = Ui ， Q ，... ， 00 } where S = Ui ， h ， 
...,t v ) represents an additive group and the element 00 is such that 00 + jc = oc 
for any x e Q. As an extension of Theorem 3.2 we then have the following 
theorem. 


Theorem 3.4 Let S be an additive Abelian group with v elements. Let A 
represent a collection of s u initial blocks A = {!Bio, ®20, •.. ， 

...,each of size k, subject to the following conditions: 

i. The blocks = 1,2,... ,s) contain k elements of S. 

ii. The blocks = 1,2,... ,u) contain oc and k — \ elements of 9- 
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m. Ignoring oo, the sk(k — 1) + u(k — \){k — 2) differences in A contain 
each nonzero member of S the same number, 入 ， of times. 

iv. X = u{k — 1). 

Then developing the blocks in A cyclically by means of the elements in 
S (mod v) yields a BIB design with parameters t = v+ l,b=(s-j- u)v, k,r = 
uv, and X = u(k — 1). 

We illustrate this theorem with the following example. 

Example 3.5 Let d = 11， Too = {0, 1， .. • ， 10, oo}, S = {0, 1,..., 10}, s = 
1, w = 1 ， and 

® 10 = (0 ， 1 ， 3,7,8, 10) 

= (oc, 0, 5, 6, 8, 10) 

The 30 + 20 = 50 differences yield each nonzero element of S five times, 
that is, A. = 5. The resulting BIB design with parameters t = 12, b = 22, k = 6, 
r=ll,A = 5is given by 


Theorem 3.4 can be generalized in much the same way that Theorem 3.2 
was generalized by defining pq symbols of the form (w = 1,2,p; j = 
1,2,... ,q) with the a ⑽ forming an additive group S. These pq symbols are 
now referred to as finite symbols to which we adjoin the symbol oo, so that 
Too = = 1,2,..., p; j = 1,2,..., q), oc}. We then have the following 

theorem. 

Theorem 3.5 Let A = {®io, ® 20 , … ，队 0 , ®’io, …， satisfy the 

following conditions: 

i. Each set = 1 ， 2, ... ，5 1 ) contains k different finite symbols. 

ii. Each set = 1,2,... ,u) contains 00 and (k — 1) different finite 

symbols. 


□ 


\—/ \—/ \—/ \7 \1/ \—/ \—/ \)/ 

00123456789 

IX 

8,9,o, o, 1,2,3, 4, 5, 6, 7, 

11 

6, 7, 8,9,o, o, 1 , 2, 3, 4, 5, 
11 

5,6, 7, 8,9,o, o, 1 ,2,3, 4, 

11 

o, 1 ， 2, 3, 4, 5, 6, 7, 8, oC o, 

1± 

8 , 8 , 8 , 8 , 8 , 8 , 8 , 8 , 8 , 8 , 8 , 

/V /IN /V /V /V /V /l\ /V /V /IV /IV 


\ 1 / \7 \)/ \—/ \)/ \—/ \)/ \—/ \7 \J. 

00123456789 

IX 

8,9,o, o, 1 , 2, 3, 4, 5, 6, 7, 

11 

7, 8,9,o,0, 1 ,2,3, 4’ 5, 6, 
11 

3, 4,5,6, 7, 8,9,0,0,1 ， 2, 

11 

1 ， 2, 3, 4, 5, 6, 7, 8,9,o, o. 


o’ 1, 2, 3, 4, 5, 6 ， 7, 8, oC 

/1\ /I\ /l\ /I\ /1\ /I\ /l\ /l\ /I\ /l\ 


, o 
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iii. Of the finite symbols of the £th class, (pu — X) occur in the sets {®/o, i = 
1,2,... ,s} and 入 occur in the sets {®^ 0 , j = 1,2,... ,u] for each t = 
1, 2,, q, implying ks = q(pu — A.), (k — \)u = q\. 

iv. The differences from the finite symbols in A are symmetrically repeated, 
each occurring X times. 

Then developing each set (block) in A cyclically (as in Theorem 3.3) yields 
a BIB design with parameters t = pq 1, b = (s u)p, k,r = up, X. 

As an illustration consider the following example. 

Example 3.6 Let p = l,q=2，t = \5，s = 3，u = 2, with 

®io = (Oi, li, 3i, O 2 , 22 , 62 ) 

®20 = (Oi, li, 3i, I 2 , 52 , 62 ) 

®30 = (Oi, 4i, 5i, O 2 , I 2 , 32) 

®’ 10 = (oc ， Oi, O 2 , I 2 , 22 , 42 ) 

® 2 0 = ( 00 , Oi, 3i, 5i, 61 , O 2 ) 

The reader can check easily that conditions (iii) and (iv) are met with 入 = 5. 
Hence the resulting design is a BIB design with parameters t = 15, b = 35, k = 
6 , r = 14, A. = 5. □ 

3.2.3 Formulation in Terms of Galois Field Theory 

In all the examples given so far, the additive group 5 has been written as S = 
{ 0 , 1 , ...，，一 1 } or S = { 0 , 1 , 1 }, representing the residues mod t or 
mod v, respectively. If r or 1 ; is prime or prime power, the elements of S can be 
expressed also in terms of the powers of a primitive root of the Galois field GF(t) 
or GF(v), respectively. For a discussion of Galois fields we refer to Appendix A. 

Since S can be expressed in terms of all powers of a primitive root of GF(t), 
then also A can be represented in terms of certain powers of such a primitive 
root. It is this fact that makes this representation useful. The reader will have 
noticed that the construction procedures presented in Theorems 3.1-3.5 depend 
upon the existence of difference sets, but we have not yet said how these can be 
obtained. One way, certainly, is by trial and error, but that is feasible only for 
small t and not very satisfactory in general. A more direct way is to make use of 
mathematical results that for certain forms of t give general explicit expressions 
for certain types of difference sets in the form of powers of primitive roots. 
For example, if t = 10€ + 1 is a prime or prime power and x is a primitive 
root of GF(10€ + 1)，then the i sets = (x l , x 2l+l , x 4l+l , x 6l+l , x u+l ), i = 
0, 1, — 1 form a difference set A with A. = 2. As an illustration consider 

the following example. 
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Example 3.7 Lett = 11,1 = 1, then^A = ®oo = (x°, x 2 , x 4 , x 6 , x 8 ), which, 
with x = 2 being a primitive root of GF(ll), translates into 

$oo = (1,4, 5, 9, 3) 

and that can then be used in Theorem 3.1 to construct a BIB design with t = 
U,b= ll,k = 5,r = 5,X = 2. □ 

We shall not pursue this any further here, but refer the reader to Raghavarao 
(1971) for an extensive list of such results and further references. 

3.3 OTHER METHODS 

To complete this chapter we mention briefly four other methods that give rise 
to quite a number of BIB designs. Of these methods, one is a simple direct 
construction procedure, two derive new BIB designs from existing BIB designs, 
and the last is based on notions of factorial experiments. 

3.3.1 Irreducible BIB Designs 

This method is applicable mainly for small t and consists of taking of the t 
symbols all possible combinations of k symbols. This leads to a BIB design with 
parameters t, b = ( 灸 ) ， k, r = (n), 人 = ( 灸一 2 ). 

3.3.2 Complement of BIB Designs 

If the collection of blocks D = {"B\, ® 2 , …， represents a BIB design with 
parameters t ， b ， k,r, and X, then, if we let = T — ® “ the collection of blocks 
D\ = {®*, ® 2 ,..., represents a BIB design with parameters t\ = t,b\ = 
b, k\ = t — k,r\ = b — r, X\ = 办 _ + 入 . 

That this is true can be seen as follows: If N denotes the incidence matrix of 
the existing BIB design, and N denotes the incidence matrix of its complement, 


then 




N + N = 


that is, 




N = 3 t 3 f b 

-N 

and hence 




羽 '= (3 t 3' b -NXHNy 

=b3 t 3[ — 3 t 3' b N' - N3 b 0[ + NN' 
= (b- 2r)0,3' t + NN f 

={b — 2 厂 )1% -j- (r — A.)/ -\r 
=(r — 入 ) / + (Z? — 2 厂 + 
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which implies that N is the incidence matrix of a BIB design, and 

r\ = (r — X) (b — 2r X) = b — r 
入 1 = Z? — 2r + 入 

3.3.3 Residual BIB Designs 

If the collection of blocks D = {*Bi, • • •, represents a symmetrical BIB 

design with parameters t，b = t ， k，r = k ， k ，then one can consider the design 
Z> 2 , which is obtained as follows: Select any block from D ， say and delete 
from each of the remaining blocks ® 2 , ® 3 ,..., the A. elements it has in com¬ 
mon with ®i. Call those sets 3*, Then D 2 = ，…， is 

the residual BIB design with parameters t 2 = t —k ， b 2 = b—\ ， k 2 = k — k ， r 2 = 
k, 入 2 = 入 . 

To see that this is true we show that in a symmetrical BIB design any two 
blocks have 入 treatments in common. This follows from 

N f NN r = ^[(r -A)/ + 入 3 々 ;] 

=[(r - X)I + X3 r J' t ]N' (3.2) 

since N f 0 t = = rO t ', and since N f is nonsingular, (3.2) implies 

N f N ^ (r -X)I +X0,0 , I 

which is the desired result. The rest is then obvious. 


3.3.4 Orthogonal Series 

This method is actually the oldest method. These designs are due to Yates (1936b) 
and are referred to as quasi-factorial or lattice designs, since their construction is 
based on concepts of factorial experiments. The designs generated by these meth¬ 
ods have parameters t = K 1 2 3 , 众 =[ ， /?= 欠 （[+ 1), 厂 =[+ 1, 入 =1 with K 
prime or prime power. This series of designs is also called orthogonal series 1 
(OS1). We shall deal with the construction of OS1 designs in Section 18.3. 

An alternative method of constructing these designs is given by Khare and 
Federer (1981). This algorithm can be described as follows: 


1. Write the treatment numbers 1,2,...,? consecutively in a square array of 
K rows and K columns to yield replicate 1 of the resolvable design, with 
rows constituting the blocks. 

2. Transpose the rows and columns of replicate 1 to obtain replicate 2. 

3. Take the main right diagonal of replicate 2 to form the first row of replicate 
3, and write the remaining elements in each column of replicate 2 in a cyclic 
order in the same column for replicate 3. 
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1 8 11 14 

2 5 12 15 

3 6 9 16 

4 7 10 13 

Because of the repeated application of step 3 above, this method has been re¬ 
ferred to as the successive diagonalizing method (Khare and Federer, 1981). □ 

3.4 LISTING OF EXISTING BIB DESIGNS 

Raghavarao (1971) provides a complete list of existing BIB designs with param¬ 
eters t,b < 100, r,k< 15 together with their method of construction. A similar 
table with difference sets is given by Takeuchi (1962). Another extensive list 
of parameters and references concerning construction of BIB designs for r < 41 
and k <t/2 given by Mathon and Rosa (1996). Table 3.1 gives a list of BIB 
designs with t <25 and k < ll, which can be constructed using the methods 
discussed in this chapter. For each design we give the parameters t ， b ， r ， k，"k and 
the method by which the design can be constructed. If Ms a prime power the ele¬ 
ments in the difference sets are given in terms of powers of the primitive element 
x for the primitive polynomials, P(x), given in Table A2 (see Appendix A). 


4. Repeat step 3 for replicate 3 to generate replicate 4. 

5. Continue this process on the just generated replicate until K replicates 


have been obtained. 

As an illustration consider the following example. 
Example 3.8 Let t = 16 = A 2 ,k = 4,b = 20, r : 


Replicate 1 


Repli 


1 

2 

3 

4 

1 

5 

6 

7 

8 

2 

9 

10 

11 

12 

3 

13 

14 

15 

16 

4 


Replicate 3 



1 

6 

11 

16 

1 

2 

7 

12 

13 

2 

3 

8 

9 

14 

3 

4 

5 

10 

15 

4 


Replicate 5 




5 

6 

7 

8 

Repli 


3 4 5 6 

11 li 1A 


4 


9 0 12te 

111 «a 

c 


5 6 3 4 
11 


o 1 
1 ± 1 ± 


2 

IX 
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Table 3.1 BIB Designs with t < 25, k <1\ 


Design 

t 

b 

r 

k 

入 

Method 

1 

3 

3 

2 

2 

1 

Irreducible 

2 

4 

6 

3 

2 

1 

Irreducible 

3 

4 

4 

3 

3 

2 

Irreducible 

4 

5 

10 

4 

2 

1 

Irreducible 

5 

5 

5 

4 

4 

3 

Irreducible 

6 

5 

10 

6 

3 

3 

Irreducible 

7 

6 

15 

5 

2 

1 

Irreducible 

8 

6 

10 

5 

3 

2 

Residual of design 25 

9 

6 

6 

5 

5 

4 

Irreducible 

10 

6 

15 

10 

4 

6 

Irreducible 

11 

7 

21 

6 

2 

1 

Irreducible 

12 

7 

7 

6 

6 

5 

Irreducible 

13 

7 

21 

15 

5 

10 

Complement of design 11 

14 

8 

28 

7 

2 

1 

Irreducible 

15 

8 

14 

7 

4 

3 

Difference set: 

(oo, 1,2,4); (0, 3, 6, 5) mod 7 

16 

8 

8 

7 

7 

6 

Irreducible 

17 

9 

36 

8 

2 

1 

Irreducible 

18 

9 

18 

8 

4 

3 

Difference set: 

r 2 r 4 y 6\. / r r 3 r 5 r 7\. 

、人，人，人 9 / 9 ，人，人，人 )' 

x e GF(3 2 ) 

19 

9 

9 

8 

8 

7 

Irreducible 

20 

9 

18 

10 

5 

5 

Complement of design 18 

21 

10 

45 

9 

2 

1 

Irreducible 

22 

10 

30 

9 

3 

2 

Difference set: (O 2 , I 2 , ^ 2 , )； 
(1i,4i,0 2 )(2i,3 1 ,0 2 )； 

(li, 4 i, 22, )； (2i, 3i, 22 )； 
(0 i,02 ,2 2 ) mod 5 

23 

10 

18 

9 

5 

4 

Residual of design 42 

24 

10 

10 

9 

9 

8 

Irreducible 

25 

11 

11 

5 

5 

2 

Difference set: 

(1,4, 5, 9, 3) mod 11 

26 

11 

11 

6 

6 

3 

Complement of design 25 

27 

11 

55 

10 

2 

1 

Irreducible 

28 

11 

11 

10 

10 

9 

Irreducible 
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Table 3.1 (Continued) 


Design 

t 

b 

r 

k 

入 

Method 

29 

11 

55 

15 

3 

3 

Difference set: (0, 1 ， 10); 

(0, 2, 9) (0,4, 7) 

(0,8,3); (0,5,6) mod 11 

30 

12 

44 

11 

3 

2 

Difference set: 

(0,1,3); (0,1,5); 

(0,4, 6 ); (oo, 0, 3) mod 11 

31 

12 

33 

11 

4 

3 

Difference set: 

(0,1, 3, 7); (0, 2,7, 8 ); 

(oo, 0, 1,3) mod 11 

32 

12 

22 

11 

6 

5 

Difference set: 
(0,1,3,7,8,10); 

(oo, 0, 5, 6 , 8 , 10) mod 11 

33 

13 

26 

6 

3 

1 

Difference set: (1, 3, 9); 

(2, 6 , 5) mod 13 

34 

13 

26 

12 

6 

5 

Difference set: 

(1,4,3, 12,9, 10); 

(2, 8 , 6 , 11,5,7) mod 13 

35 

13 

39 

15 

5 

5 

Difference set: (0, 1, 8 , 12, 5); 
(0, 2,3,11,10); 

(0,4, 6 , 9, 7) mod 13 

36 

15 

35 

7 

3 

1 

Difference set: 

(li, 4i, O 2 , )； (2i, 3i, O 2 )； 

(I 2 , 42 , O 3 )； ( 22 , 32 , O 3 ,); 

(I 3 , 43 , Oi); ( 23 , 33 , Oi); 

(Oi, () 2 , 03 ) mod 5 

37 

15 

35 

14 

6 

5 

Difference set: 

( 00 , Oi, O 2 , I 2 , 22,42); 
( oo ,0 i ,3 i ,5 i ,6 i ,0 2 )； 

(Oi, li, 3i, O 2 , 22 , 62 )； 

(Oi, li, 3i, I 2 , 52 , 62 )； 
(0i,4i,5i, 0 2 , l 2 , 3 2 ) mod 7 

38 

16 

24 

9 

6 

3 

Residual of design 50 

39 

16 

80 

15 

3 

2 

Different set: 

/ v 0 r 5 r 10\. / V 1 r 6 r 11\. 

、入，入，人 ) 9 V 入 ，•^ ，人 ) 9 

(J1 r 7 r 12\. / r 3 r 8 r 13\. 

、人，入，入 / 5 V入 ，人 / ^ 

(x 4 ,x 9 ,x 14 ); x e GF(2 4 ) 

40 

16 

48 

15 

5 

4 

Difference set: 

/ r 0 r 3 r 6 v 9 v 12\. 

、人，入 ^ ，人 / 9 

/ V 1 r 4 r 7 v 10 v 13\. 

、人，入，入 ) ? 

/ r 2 r 5 r 8 r ll r 14\. 

，入，入 / 9 

x € GF(2 a ) 
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Table 3.1 {Continued) 

Design t b 


41 

19 

57 

9 

3 

42 

19 

19 

9 

9 

43 

19 

19 

10 

10 

44 

19 

57 

12 

4 

45 

21 

70 

10 

3 


46 21 42 12 6 


47 22 77 14 4 


48 

23 

23 

11 

11 

49 

25 

50 

8 

4 

50 

25 

25 

9 

9 

51 

25 

100 

12 

3 


入 Method 

1 Difference set: 

(1,7,11); (2, 14, 3); 

(4, 9, 6 ) mod 19 

4 Difference set: 

(1,4,16,7,9, 17,11,6, 5) 
mod 19 

5 Complement of design 42 

2 Difference set: 

(0,1,7,11); (0,2,14,3); 

(0, 4, 9, 6 ) mod 19 

1 Difference set: 

(1i,6i,0 2 ); (2i,5i,0 2 )； 

(3i, 4i, O 2 )； (12, 62 , O 3 ); 
(2 2 ,5 2 ,0 3 ); (3 2 ,4 2 ,0 3 ); 

(I 3 , 63 , 0 i); ( 23 , 53 , 0 i); 

(3 3 ,4 3 ,0i); (0i,0 2 ,0 3 ) mod 7 

3 Difference set: 

(0i, 5i, I 2 ,42, 23 , 33 ); 

(Oi, li, 3i, O 2 , I 2 , 32); 

(O 2 , 52, I 3 ,43, 2i, 3i); 

(O 2 , 12, 32 , O 3 , I 3 , 33 )； 

(O 3 , 53 , li, 4i, 22 , 32 )； 

(O 3 , l 3 , 3 3 , 0 i, li, 3i) mod 7 

2 Difference set: 

(li, 61 ， 32, 42); (3i, 4i, 22, 52 )； 
(2i, 5i, 62 , 12 )； (I 2 , 62 , 33 , 43 ); 
( 32 , 42 , 23 , 53 )； ( 22 , 52 , 63 , I 3 )； 
(I 3 , 63 , 3i, 4i); ( 33 , 43 , 2i, 5i); 
( 23 , 53 , 61 ， li); ( 00 , Oi, O 2 , O 3 ); 
( 00 , Oi, O 2 , O 3 ) mod 7 

5 Difference set: 

(1,2,4,8, 16,9, 18, 13,3,6, 12) 
mod 23 

1 Difference set: 

(0,x°,x s ,x l6 y, 

(0,x 2 ,x lo ,x K y, 

X e GF(5 2 ) 

3 See Fisher and Yates (1957) 

1 Difference set: 

/ r 0 v 8 v 16\. / V 1 v 9 r 17\. 

、入，人 ) 9 ，人，入 ) 9 

/ r 2 Y 10 r 18 v / r 3 r ll r 19 v 

\ ，人 / 9 ，•^ ，人 / 9 

x e GF(5 2 ) 






CHAPTER 4 


Partially Balanced Incomplete 
Block Designs 


4.1 INTRODUCTION 

It can be seen very easily from the list of existing BIB designs [see Raghavarao 
(1971), Mathon and Rosa (1996), and also Table 3.1] that such designs exist for 
certain parameters only and often with an inordinately large number of replicates, 
the main reason being that condition (2.4) has to be satisfied with 入 being an 
integer. For example, using (2.4), with 8 treatments and blocks of 3 units, the 
lower limit for the number of replicates is 21, and with this number of replicates 
the blocks would consist of all combinations 3 at a time of the treatments. In 
fact, Yates himself recognized that BIB designs are “rare” and then developed 
different types of quasi-factorial or lattice designs (see Chapter 18). Although 
some of the lattice designs are special cases of BIB designs (see Section 3.3.4 )， 
others are special cases of a much larger class of designs, namely partially 
balanced incomplete block (PBIB) designs. These were introduced by Bose and 
Nair (1939). ^ 

Recall that BIB designs have the property that all treatment differences are 
estimated with the same accuracy. For PBIB designs this property will not be 
sacrificed completely, but only to the extent that, loosely speaking, pairs of treat¬ 
ments can be arranged in different sets such that the difference between the 
treatment effects of a pair, for all pairs in a set, is estimated with the same 
accuracy. 

4.2 PRELIMINARIES 

Before giving the definition of a PBIB design, we shall take a look at the reduced 
normal equations (1.7) for a general incomplete block design, investigate the 
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PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS 


structure of these equations in light of the remarks made in the introduction, and 
see what that might imply. 


4.2.1 Association Scheme 

Recall that the RNE for a proper equireplicate incomplete block design are of 
the form [see (1.7)] 


where 


rI--NN'\r = T--NB 


NN = 〉: = (A"’ ）（ say) 


For the BIB design "ku，= k for all / ， 〆（/# /’）and = r for all i. The term 
kii ，denotes the number of times that treatments i and i' appear together in the 
same block. We now wish to consider the case that for a given treatment i, the 
numbers Xu' are equal within groups of the other treatments. Suppose that over 
all i and i' the 入 take on the values 入 i, 入 2 ,… ，入 m. For the sake of argument we 
shall assume that the X u (u = 1, 2,..., m) are different although it will become 
clear later that this is not necessary. The set of treatments i' for which 入 = 入 1 
will be denoted by S(i, 1) and called the first associates of treatment i. The 
set of treatments i' for which 入 = 入 2 will be denoted by S(i, 2) and called 
the second associates of treatment i, and so forth. For any given i then the 
remaining treatments fall into one of the sets S(i, 1), S(i, 2),..., S(i, m). The 
question now is: Under what conditions does this association scheme imply that, 
for example, treatment i and all the treatments i' in S(i, u) are compared with 
the same precision and when is this the same for all pairs of wth associates 
independent of the treatment /? 


4.2.2 Association Matrices 


In order to investigate this question we shall use the concept of an association 
matrix ，which was introduced by Thompson (1958) and investigated in more 
detail by Bose and Mesner (1959). Define the uth association matrix, sl t x t 
matrix, as 


= (b^) (w = 1, 2,..., m) 


with 


Kp = 



if treatment a and ^ are wth associates 
otherwise 
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At this point it is convenient to introduce Bq = I and 入 o = r. It follows then 
from what has been said earlier that 

m 

J2 B > (4.1) 

u=0 

Furthermore, 

m 

NN' = (4.2) 

u=0 

and hence 

1 , k- 1 f 

c = rl - -NN' = r ^ y B u (4.3) 

U=1 

or in words: The coefficient matrix C of the RNE is a linear combination of the 
association matrices. 


4.2.3 Solving the RNE 

To solve the RNE, we know from (1.17) that 

cc~ l ^1 -j00 r (4.4) 

where C = C + a33 r and a ^0, real. Using (4.1) and (4.3) we can rewrite (4.4) 
in terms of association matrices as follows: 



(4.5) 


where 



入 M 


(u = 1, 2, ..., m) 


c u = a 


Now, let 



(4.6) 


Then we know that for any two pairs (/，/’）and (/, i〃) 

var (r；- - = (gu + g iV — 2g iV )<y ； (4.7) 

var(r ； - = (g u + gi 〒一 2g u ")c^ (4.8) 
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Suppose that i' and i" are uth associates of i. Then we would like to have that 
(4.7) and (4.8) are equal. One way that this can happen is if 

gii = gi，V = gi 〒 

and 

gw = gw ， (4.9) 

Furthermore, we would like var(r) — r)/) to be equal to the right-hand side of 
(4.7) for any pair (j, j f ) that are uth associates. It is obvious that this can be 
achieved if G is of the form 

m 

^ guB u 
u=0 

Substituting (4.10) into (4.5) yields 

^~' t _ 1 

/ y c u§v B U B V = 丁 B 0 

M,l>=0 

Since the right-hand side of (4.11) is a linear combination of the B u ’s, the 
left-hand side also must be one. This would imply that B U B V is of the form 

m 

B u B v = J2pL B k (4.12) 

k=0 

that is, B U B V is itself a linear combination of the B u ’s. 

4.2.4 Parameters of the Second Kind 

It will turn out that the coefficients p\ v in (4.12) play an important role in 
the definition of PBIB designs. It is for this reason that they are referred to as 
parameters of the second kind. To understand what they really are and to grasp 
the full meaning of (4.12), let us consider two treatments, a and ^ say. Consider 
then the (a, P) element of B U B V and of p^ v The (a, P) element of 
B U B V is of the form 

E KyKp 

K=1 

Since B u is symmetric (follows from the fact that if a is the £th associate of 
then ^ is also the £th associate of a), this is the same as 


(4.10) 


1 rn 

(4-H) 


E Wr 


( 4 . 13 ) 
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which equals the number of unity elements the ath row of B u and ^th row 
of B v have in common, or expressed differently, which equals the number of 
treatments that S(a, u) and S(P ， v) have in common. The corresponding element 
of the right-hand side of (4.12) is 


T,Puv b U (4.14) 

k=0 

Because of (4.1) there is only one element equal to one, namely if a 
and ^ are €th associates. Hence 

t 

PL = J2 Ky b h (4 . 15 ) 

K=1 

The left-hand side of (4.15) does not depend explicitly on a and p but only 
on the type of association between a and p and hence is the same for any two 
treatments that are ith associates. In particular for a = ^ and u = v wc obtain 

t t 

Puu = (^ay) 2 = ^ay = n u ( sa y) 

y=l y=l 

which is the number of elements in S(a, u). This implies that every treatment 
has n u wth associates. In summary then, if there exists an association scheme 
such that (4.15) holds, then (4.10) and hence (4.9) can be achieved. 


4.3 DEFINITION AND PROPERTIES OF PBIB DESIGNS 

We now give the formal definition of a PBIB design based on the definition given 
by Bose and Shimamoto (1952) and derive some of its properties in terms of its 
various parameters. 

4.3.1 Definition of PBIB Designs 

With the motivation provided in Section 4.2 we now have the following definition. 

Definition 4.1 An incomplete block design is said to be a PBIB design, if it 
satisfies the following conditions: 

1. The experimental material is divided into b blocks of k units each, different 
treatments being applied to the units in the same block. 

2. There are t treatments each of which occurs in r blocks. 
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Hence n\ = 3, ^2 = 3, = 1, and, from inspection of the design, = 

2, 入 2 = 1, 入 3 = 0， and 


3. Any two treatments are either 1st, 2nd, .. mth associates; each treatment 
is the 0th associate of itself and of no other treatment. 

4. Each treatment has n u wth associates (m = 0, 1,..., m). 

5. If any two treatments are kth associates, then the number of treatments 
common to the uth associates of the first and the vth associates of the 
second is p^ v , independent of the pair of kth associates. 

6. Any two treatments that are uth associates appear together in blocks. 

The terms t, b, r, k, A ， 2 , ..., n\, ri 2 , …， n m are called the parameters 

of the first kind, and the p^ v are called parameters of the second kind. 

The parameters of the second kind are generally exhibited in matrix form as 
P/c = (Pu V )(k = 0, 1,..., m), and in our notation these are (m + 1) x (m + 1) 
matrices (this deviates from the original notation by Bose and 
Nair, 1939). □ 

As an illustration of a PBIB design we consider the following PBIB design 
with three associate classes. 

Example 4.1 Let t = S,b = 6,r = 3,k = 4 


Block 

Treatments 

1 

l, 

2, 

3, 

4 

2 

l, 

2, 

5, 

6 

3 

l, 

3, 

5, 

7 

4 

2, 

4, 

6, 

8 

5 

3, 

4, 

7, 

8 

6 

5, 

6, 

7, 

8 


The association scheme is as follows: 

0th Associate 1st Associates 2nd Associates 3rd Associates 


6 


4 


2 1 


78878765 
6，5,5， 6，3，4，4， 3, 
4, 3, 2， 1, 2, 1" 1，2, 


3,4, 4, 3, 6, 5, 5, 6, 
2, 1， 1, 2, 1， 2, 3, 4, 


4 
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尸 0 = 


"1 _ 


"o 

1 

0 

o " 

3 0 

Pi = 

1 

0 

2 

0 

0 3 

0 

2 

0 

1 

1 


0 

0 

1 

0 


Pl = 


"0 

0 

1 

o ' 


"0 

0 

0 

r 

0 

2 

0 

1 

Ps = 

0 

0 

3 

0 

1 

0 

2 

0 

0 

3 

0 

0 

0 

1 

0 

0 


1 

0 

0 

0 
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□ 


4.3.2 Relationships Among Parameters of a PBIB Design 

The following relationships exist among the various parameters of a PBIB design: 


tr = 

bk 

(4.16) 

m 

J2 nu = 

t 

(4.17) 

u=0 

m 

〉 : "M 入 M = 

rk 

(4.18) 

M=0 

Puv = 

Pvu 

(4.19) 

m 

X] Puv = 

n v 

(4.20) 

u=0 

uv = n uPtv 

=n vPuk 

(4.21) 


Relationships (4.16), (4.17), and (4.18) follow immediately from the definition 
of a PBIB design. 

To verify (4.19) we note that the relation of association is symmetric and that 
therefore the association matrices B u are symmetric. Hence 


B U B V = B u B[, = (B v B u y = 



f 


=^2 = ^ Pvu 

k k 


which implies (4.19). 
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To verify (4.20) consider 


and 



B v = 33’B v = n v 33 , = n v 〉: 

k=0 


E B - B ^ = E (e puv) ^ 

m=0 u k k \ u / 


Comparing coefficients yields (4.20). 

Finally, to derive (4.21) we use (4.12) twice and consider 


and 


(B u B v )B^Y,\Lp^Pu) B s 

5=0 \k=0 / 


(B l B v )B u ^Y J \HP k lvPlu)Bs 

5=0 \A：=0 / 


Comparing coefficients yields 

J2 PuvPkl = J2 PivPku ( 4 . 22 ) 

k k 

which for 5 1 = 0 reduces to 

nep e uv = n u p u tv 

since = ni fork = i and zero otherwise as follows easily from the, trivial, 
definition of 0th associates. Concerning the parameters of the second kind, we 
have already mentioned that 



Puv = | 

n u 

for u = v 

(4.23) 


0 

otherwise 


Also, 

POv = 

I 1 

fork = v 

(4.24) 


0 

otherwise 
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which follows immediately from 

Bv = BoB v — ^2 Pov 

k 

Equations (4.23) and (4.24) hold for any PBIB design, whereas the remaining 
p\ v are determined by the particular association scheme for a given PBIB design, 
as will be shown later. For this reason we refer to a PBIB design as defined 
above as an m-associate class PBIB design, which henceforth we shall denote 
by PBIB(m) design. 

4.4 ASSOCIATION SCHEMES AND LINEAR 
ASSOCIATIVE ALGEBRAS 

In the motivation and derivations given in Sections 4.2 and 4.3 we have made 
use of some results given by Bose and Mesner (1959), in particular, relationship 
(4.12), which leads to the introduction of the parameters of the second kind. There 
exists, indeed, a much more mathematical relationship between the association 
matrices, B u , and certain matrices with elements p^ v . This relationship provides 
some further insights into the structures of PBIB designs. We shall give a brief 
account of the major results but refer the reader to Bose and Mesner (1959) for 
details. 

4.4.1 Linear Associative Algebra of Association Matrices 

Consider the association matrices B u {u = 0, 1, 2,..., m) defined in Section 
4.2.2. We have seen [see (4.1)] that 

忍 o + 忍1 + • • • + B m = 00 r 

It follows then that the linear form 

c o-^o + c i^i + ... + c m B m (4.25) 

is equal to zero if and only if 

cq = c\ = • • • = c m = 0 

Hence the linear functions (4.25) of the B u ’s form a vector space with 
Bo, B\,, B m as basis. 

We also know [see (4.12)] that 


B U B V = 〉: p^ v Bk 


(4.26) 
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If A and B are two matrices of the form (4.25), then the product AB is, 
because of (4.26), again of the form (4.25). The set of matrices of the form 
(4.25) is, therefore, closed under multiplication. It also forms an Abelian group 
under addition. Also, because of (4.19), multiplication is commutative. As a 
consequence, the set of matrices of the form (4.25) constitute a ring with unit 
element, ^o- It will be a linear associative algebra if the coefficients c u range 
over a field. 

4.4.2 Linear Associative Algebra of V Matrices 

We have introduced in Section 4.3.1 matrices Pk = {Puv^ whose elements are 
the parameters of the second kind (/: = 0, 1,..., m). Following Bose and Mesner 
(1959) we introduce matrices 



~Pl 

POv 



— 

p\v 

P\v 

… pZ 

(4.27) 


-Pmv 

Pmv 

… P'mv- 



with v = 0, l,..., m. We note that in (4.27), the superscript k is the column 
index, and the first subscript u is the row index. Also note that although the Pk 
are symmetric, the < S ) V are not necessarily symmetric. 

We now consider equation (4.22), that is, 

Yl PuvPkl = PivPku 
k k 

which follows from the associative and commutative laws of multiplication for 
the B u ’s, and that, because of (4.19), can be written as 

J2 PuvPkl = J2 PviPuk ( 4 . 28 ) 

k k 

The left-hand side of (4.28) is the element in the uih row and 5th column of TyTi ， 
Since the element in the uth row and *yth column of 3^ is p s uk , the right-hand 
side of (4.28) is the element in the uth row and 5th column of 

少 1 + ... + /Cm 

It follows then that we have the representation 


y v y e = J2p k vt n 


(4.29) 
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This corresponds to (4.12) and shows that the 7 matrices multiply in the same 
way as the B matrices. 

Since p^ v = 1 for ^ = i; and 0 otherwise [see (4.24)], the Oth row of ( 5 } v 
contains a 1 in column v and 0’s in the other positions. This implies immediately 
that the linear form 


^0^0 + ^1^1 (4.30) 

is zero if and only if co = c\ = . • • = c m = 0. Thus the 7 matrices are linearly 
independent and hence form the basis of a vector space. Since they combine 
in exactly the same way as the B matrices under addition and multiplication, 
they provide a regular representation in terms of (m + 1) x (m + 1) matrices of 
the algebra of the B matrices, which are ? x r matrices with t > m 1. The IP 
matrices behave in the same way as the B matrices. Hence, the set of matrices 
of the form (4.30) also constitute a linear associative algebra with unit element 
^ 0 = ^ m+l. 

4.4.3 Applications of the Algebras 

Bose and Mesner (1959) describe in detail applications of the algebras to com¬ 
binatorial problems and algebraic properties. We have touched on some of these 
in Section 4.3. We shall mention here only one further application, namely the 
determination of the eigenvalues of the C matrix for PBIB(2) designs. 

We have seen in our earlier discussion (see, e.g.，Sections 1.3.3 and 1.10) that 
the eigenvalues of the C matrix of a given design or, alternatively, of its NN’ 
matrix play a major role in describing properties, for example, the efficiency 
factor, of the design. We have seen that NN f has one eigenvalue equal to rk. To 
obtain the other eigenvalues we make use of the following arguments. 

Let B = c u B u and let /(0) be a polynomial. Then f(B) can be 
expressed as 


f(B) = ^ B u 

u=0 

If IP = is a representation of B, then 


/CP) = L 4 n 

u=0 

If /( 入 ） is the minimum function of B and 0( 入 ） the minimum function of 
it can be shown that 


m = 入 ) 


which implies that B and 7 have the same eigenvalues. 



130 


PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS 


We can apply this result to the specific case where 

B = N = XqBq -|- X \ S i + ... + A, m B m 
and 

少 =^0^0 + 人 13^1 + . . . + 入 

It follows then that NN’ has at most m + 1 distinct eigenvalues. We know 
already that rk is one eigenvalue of NN r with multiplicity 1, and it is the largest 
eigenvalue of NN f . The eigenvalues are determined from \ J > — 61 \ =0. Specif¬ 
ically for m = 2 and after removal of ^ = rk, the remaining two eigenvalues are 
determined from 

(入 0 _ 沒) 2 + [(入 1 _ 入 2)( P ?2 _ p }〗） _ ( 入 1 + 入 2)] (入 0 _ 沒） 

+ ( 入 1 — 入 2)( 入2尸!2 _ 人1尸?2) +入1入2 = 0 

which yields [see Bose and Mesner (1959), Connor and Clatworthy (1954), and 
Street and Street (1987) for an explicit derivation] 

^ = 入0 _ 士 [(入 1 _ 入2)( _ 入 + + (入 1 + 入 2)] (4.31) 

for u = 1,2, and 

y = P\2 ~ P\2 A = + 20 + 1 ^ = P\2 + P\2 

and, of course, 入 o = r. We then have 

I NN f -61\ = {rk- 0)(0i - 0) otl {e 2 - 0) ai 

where a\, a 2 are the multiplicities of 0\ and O 2 , respectively. To determine ot\ 
and q ?2 we use the fact that 

trace I = l a\ a 2 = t 


and 


trace NN f = rk a\0\ + (X 2 O 2 = tr 

Solving these equations and substituting (4.31) yields 

n\-\-ri2 (n\ - m) + y(n\ + ^ 2 ) 

- r (—1 ； -- 

2-s/A 


(4.32) 


oi u = 


2 
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for m = 1 ， 2. These results imply that the matrix 

1 , 

C = rl - —NN' 
k 

has eigenvalues 

e u 

d u =r-^- (4.33) 

k 

with multiplicities = 0, 1, 2), where = rk, ao = 1, and 9\, O 2 and a\, a 2 
are given by (4.31) and (4.32), respectively. 

Similar results can, of course, be obtained for PBIB designs with m > 2 asso¬ 
ciate classes. 

The eigenvalues given in (4.33) together with their multiplicities for w = 1, 2 
can be used to obtain the efficiency factor of a given design (see Section 1.11). 
We shall also make use of them in Section 4.7 in connection with the combined 
analysis of PBIB designs. 


4.5 ANALYSIS OF PBIB DESIGNS 
4.5.1 Intrablock Analysis 

The analysis of PBIB(m) designs can be derived easily from the general analysis 
of incomplete block designs as presented in Section 1.3. In order to solve the 
RNE [see (1.8)] ^ 

Cr= Q 

where C is given in (4.3) and Q is the vector of adjusted treatment totals, we 
shall find a generalized inverse C 一 for C satisfying (1.17); that is, 

CC~ =1 - -33' 

t 

by utilizing (4.11) and (4.12) as outlined previously. This leads to the following 
system of m + 1 equations in m + 1 unknowns go, g\, … ， g m : 

mm 1 

c uPuv^v = ^ ~ ~ f or k = Q 

u=0 i>=0 


for k = 1, 2, ..., m 


(4.34) 
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Since 

mm. m 

c uPuv = ^2/ Curiu = ^ 

M =0 l >=0 M =0 

it follows that this system of equations is not of full rank. Therefore any m of 
the equations in (4.34) can be taken and solved with an additional convenient 
restriction like ^ = 0 or ^ ^ugu = 0, or for some v, g v = 0. 

We shall illustrate this procedure for a PBIB(2) design. The system (4.34) is 
then of the form 


co 卯 + cinigi + C2n 2 g2 = 1 -- 
Cl 及 0 + (Co + Cip\ x + C2p\ 2 )g\ + {c\p\ 2 + C2p\ 2 )S2 = 

C2go+ (ciPn + C2Pn)s\ + (co + c\p\ 2 + c 2 pl 2 )8i = -舍 

with 

r(k- 1 ) 入 2 

co = ^r~ ci = —t C1 二 —t 

Letting go = 0 and omitting the first equation then yields 

(c 0 + Cip l n + C 2 p\ 2 )g\ + (Cip} 2 + C 2 p\ 2 )g2 = -y 
(C\Pn + C 2 Pi 2 )gl + (c 0 + + C 2 P 2 2 )g 2 = -J 


(4.35) 


(4.36) 


Example 4.2 Consider the following PBIB(2) design (SR 36 in Clatworthy, 
1973) with parameters t = S, r = 4, k = 4, b = S, X\ = 0, 入 2 = 2: 


Block 


Treatments 
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Hence co = 3,c\ = 0, C 2 =— 乏 ， and (4.36) becomes 

— 3g2 = 

+ 容 2 = — I 

yielding g\ = —0.3333 and g 2 = —0.2917. 

Having solved (4.34) or its reduced form for 如， gi, … ， g m ，we obtain the 
intrablock estimates of treatment effects as 


gvB v Q (4.37) 


or, equivalently, 

% — §oQi + ^ Qj §2 ^ 2( + … +^ Qi (4.38) 

j€S(i,l) keS{i,2) ieS(i ， m) 

(i = 1,2,... ,t) where S(i, u) is, as defined earlier, the set of wth associates of 
treatment i. It follows further from (4.37) and the definition of the association 
matrices that 

var(f；— 节，） = 2(g 0 - g u )cr]- (4.39) 



which has the following association scheme: 

0th Associate 1st Associates 2nd Associates 


7, 7, 6, 6, 7, 7, 6, 6, 
6, 5, 5, 5, 6, 5, 5, 5， 
4, 4, 4, 3, 4, 4, 4， 3’ 
3, 3, 2, 2, 3, } 2, 2, 

11 11 11 11 11 11 


12 3 4 


114 

o o 1 
o o 1 


^2 


0 0 6 
loo 
o 1 o 


p 


nd 


12 3 4 


ng 

giv 


^o 
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if treatments i and i' are uth associates. There are, therefore, m types of compar¬ 
isons which, however, in certain cases may not all be distinct. Correspondingly, 
there are also at most m types of efficiency factors, E u . Since 

广 〜 2a e 2 

var(T/ — Ti ')= - 

r 

for a CRD or RCBD, we have 

E u = ~~ - (4.40) 

r(go~ gu) 

(w = 1, 2, … ， m) (see Section 1.11). The overall efficiency factor of a PBIB(m) 
design relative to the CRD or the RCBD is then given by 

1 m 

E = —- J2 (4.41) 

t u=\ 

For the PBIB(2) design discussed in Example 4.2, we find (since go = 0) 
var(ri — r/0 = .6666〜 2 

for 1st associates and 

var(r ； — r/0 = .5834〜 2 

for 2nd associates. Also, from (4.40) and (4.41) 

1 

E\ = - = .750 

1 

五 2 = - = .875 

— rg 2 

E = + 6 E 2 ) = .842 D 


4.5.2 Combined Analysis 

As developed in Section 1.8.3, the set of normal equations for the combined 
analysis is of the form 


with 


At = P 


(4.42) 


A = rl - 1 — P — NN , 
k 


(4.43) 
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and 

1 - p 一 1 

P = T - —NB (4.44) 

k 

In order to obtain T we use the same type of argument as in Section 4.2.1 ， 
that is, make use of the special form of A as expressed in terms of association 
matrices; that is, 


with 


A = 〉: cin B 

M=0 


(4.45) 


ao = 




= _ 



(u = 1, 2, ... ， m) 


(4.46) 

(4.47) 


Writing A -1 as 


A~ l = J2 a ll B u 

M=0 


we obtain, equivalent to the set of equations (4.34) for the intrablock analysis, 
the following system of equations: 


AA~ 



or equivalently, 



^ Puv B k = 1 

uv k=0 


(4.48) 


^ ^ a u p^ v a v = 8ok (k = 0, l,... ,m) (4.49) 

M=0 v=0 


where 8 ok is the Kronecker symbol. This set of equations is of full rank and 
hence we have (m + 1) equations in (m + 1) unknowns. 

It follows then that 

Pi -\- E p j+ a2 E + … . + a m 匕 (4.50) 

j€S(i,l) keS(i,2) l€S(i,m) 


where the P M (w = 1, 2,..., 0 are the elements of P in (4.44). 
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For any two treatments that are uth associates we have consequently 

var — i^/) = 2(a° — a u )o^ (4.51) 

We shall illustrate the procedure by using Example 4.2 given in Section 4.4.1. 

Example 4.2 (Continued) We have t = S, r = Xq = 4, k = 4, b = S, X\ = 

0 , 入 2 = 2 . 

"1 1 [0 1 0" 

P 0 = 1 P\= 1 0 0 

6 」 L 0 0 6 - 

_ o o r 

P 2 = 0 0 1 

_1 1 4_ 

Then, from (4.46) and (4.47), 

ao = 4 (1 _ - ~~^ — ) = 3 + p _1 

ai = 0 

= _(1 _ P ~ X )\ 

and hence Eqs. (4.49) are given as 

(3 + p])a 0 -3(1-p~ x )a 2 = 1 

(3 + p- 1 )^ 1 -3(l-p~ 1 )a 2 = 0 

—士 (1 — p—i)a。— 士 (1 — p—i)ai +[3 + p_i 2(1 = 0 

There remains, of course, the problem of estimating p or p~ l . Substitu¬ 
ting such an estimator in the equations above, or more generally equations 
(4.49), will then yield solutions for a 0 , a 1 , a 2 which we denote by a 0 , a' 1 ,^ 2 
and which, in turn, are used in Eqs. (4.50) and (4.51). One such estimator is 
the Yates estimator as described in Section 1.10.1. For certain types of PBIB(2) 
designs we shall present in Section 4.7 other estimators for p similar to the one 
given for BIB designs (see Section 2.5), which will lead to uniformly better 
estimators (than the intrablock estimators) for treatment contrasts under certain 
conditions. □ 
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4.6 CLASSIFICATION OF PBIB DESIGNS 

Since their introduction by Bose and Nair (1939) and Bose and Shimamoto (1952) 
different types of PBIB designs have evolved. These can be classified according 
to the number of associate classes and type of association scheme. (We mention 
here parenthetically that the association scheme and the parameters of a PBIB 
design do not lead immediately to the actual experimental layout or field plan; 
see also Chapter 5). In this section we shall give a brief survey (without being 
complete) of types of PBIB designs. 

The designs perhaps most often used are PBIB(2) designs. The reason, of 
course, is that they constitute the simplest extension of the BIB design. They have 
been tabulated (for certain ranges of parameters) extensively by Bose, Clatworthy, 
and Shrikhande (1954) and Clatworthy (1956, 1973) and are classified according 
to which of the following association schemes (to be explained in the following 
sections) is being used: 


1. Group-divisible PBIB(2) designs 

2. Triangular PBIB(2) designs 

3. Latin square type PBIB(2) designs 

4. Cyclic PBIB(2) designs 

We then mention the association schemes and parameters for some PBIB(3) 
designs: 

1. Rectangular PBIB(3) designs 

2. Generalized group-divisible PBIB(3) designs 

3. Generalized triangular PBIB(3) designs 

4. Cubic PBIB(3) designs 

Finally, we discuss some rather general classes of PBIB designs with m > 3 
associate classes: 

1. Extended group-divisible PBIB designs 

2. Hypercubic PBIB designs 

3. Right-angular PBIB(4) designs 

4. Cyclic PBIB designs 

4.6.1 Group-Divisible (GD) PBIB(2) Designs 

Suppose that t can be written as ? = t\t 2 and that the treatments are divided into 
t\ groups of t 2 treatments each by arranging them in a rectangular array of t\ 
rows and h columns (^i, t 2 > 1), the rows constituting the t\ groups. Then two 
treatments are called first associates if they are in the same group and they are 
called second associates otherwise. 
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and hence n\ = 1, ri 2 = 6. 

Bose and Connor (1952) have shown that for GD-PBIB(2) designs the fol¬ 
lowing inequalities hold: 

r $ 入 i rk — X 2 t > 0 

Accordingly, they have divided the GD designs further into three subclasses: 

1. Singular if r = 入 1 

2. Semiregular if r > X\,rk — 入 y = 0 

3. Regular if r > X\,rk — X 2 t > 0 


This association scheme implies n\ = t 2 ~ I and = (t\ — 1)^2- Further 



"0 1 

0 一 


"0 

0 

1 一 

Pl = 

1 t2-2 

0 

尸 2 = 

0 

0 

,2 — 1 


0 0 

Ol — 1)Q 


_1 t 2 - 1 

(t\ — 2)t2 


Example 4.3 Let t = S,t\ =2,t2 = 4. 

Association scheme: 

13 5 7 
2 4 6 8 

Oth Associate 1st Associates 2nd Associates 


□ 


4- 


Is 

a 


arlar 
Exgul 


6, 5, 6, 5, 6, 5, 6, 5, 


Fo 


4, 3, 4，3，4，3，4, _ 


re( 

' lu'e 
， tliqth 

21212121 

sa' 
y c 
a 

wch 
ah 
Dtw 

78787856 M n 
5, 6, 5, 6, 3, 4, 3, 4 ， se., 2 ， 

r I- 

3,4, 1 ， 2, 1 ， 2, 1,2,o 

ch 

of4’ 

5 -I 

s 


5 6 7 8 
12 3 4 


4 


6 7 


of? 
Onlha 
atlel 
izw 
tol as 
fac ust 
e j 
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4.6.2 Triangular PBIB(2) Designs 

Suppose that t can be written as t = q(q — 1)/2 and that the treatments are 
arranged in triangular form above the main diagonal of a ^ x q array and repeated 
symmetrically below the main diagonal leaving the main diagonal blank. Then 
two treatments are called first associates if they lie in the same row (or column) 
of the q x q array and are second associates otherwise. 

This association scheme implies 

n\ =2q — 4 H 2 = (q — 2)(q — 3)/2 

"0 1 0 
尸 1 = 1 q — 2 q — 3 

_0 q - 3 (q -3)(^ - 4)/2_ 

"0 0 1 

P 2 = 1 4 2 q-S 

1 2^ — 8 — 4) — 5)/2 

Example 4.4 Let t = 10, q = 5. 

Association scheme: 



0th Associate 


1st Associates 


2nd Associates 
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4.6.3 Latin Square Type PBIB(2) Designs 

Suppose that t can be written t = q 2 and that the treatments are arranged in 
a square array of size q. 

For a Latin square type PBIB(2) design with two constraints [L2-PBIB(2) 
design], two treatments are called first associates if they occur together in the 
same row or the same column of the square array and are second associates 
otherwise. 

For the Latin square type design with i constraints (/ > 2) [L/-PBIB(2) design] 
the association scheme is used in conjunction with i — 2 mutually orthogonal 
Latin squares (if they exist) (see 1.10.6.2). Then two treatments are called first 
associates if they both occur in the same row or column or correspond to the same 
letter of the superimposed Latin squares. Hence we obtain for the L/-PBIB(2) 
design 

n\ =i{q - 1) ri 2 = (q - l)(q - / + 1) 

"0 1 0 
•P1 = 1 — 3 / + g (i — 1)(^ — / + 1) 

_0 (i — 1)(^ — / + 1) {q — i)(q — / + 1)_ 

"0 0 1 
P 2 = o i(i - 1) i(q - 1) 

1 i(q — i) (q — i) 2 + / — 2 

Example 4.5 Let t = 9, q = 3. 

/^-association scheme: 

1 2 3 
4 5 6 
7 8 9 

0th Associate 1st Associates 2nd Associates 


998998665 
8, 7, 7, 8, 7, 7,5 ,4, 4, 
6, 6, 5, 3, 3, 2, 3, 3, 2, 
irT 4，4， 2, 1, 1， 2, 1, 1" 

789789998 
4, 5, 6, 6, 6, 5, 8, 7, 7, 


3，2, 5，4，4，4, 5, 



123456789 


□ 
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Other examples are mentioned m Section 18.7. 


4.6.4 Cyclic PBIB(2) Designs 

Suppose the treatments are denoted by 0, 1, 2,— 1. Then the first associates 
of treatment i i d\, i d 2 ,..., i d ni (mod t) where the dj are integers 
satisfying the following conditions: 

1. The dj are all different with 0 < dj < t for each j. 

2. Among the n\(n\ — 1) differences dj — dy (mod t) each of the integers 
d\, d 2 , … ， d ni occurs a times, and each of the integers e\,e 2 ,, e n2 
occurs ^ times, where d\, di, … ， d ni , e\, e 2 ,... ， e n2 are all the different 
integers 1 , 2 ,..., r — 1 . 


The second associates of treatment i obviously are i e\,i e 2 ,.. 
(mod t). We then have, as a consequence of condition 2, 


^ i + e H2 


n\ot + n 2 ^ = n\{n\ — 1 ) 


and 


P\ 


0 


1 


0 


1 a n\ — a — l 

0 n\ — a — l n 2 — n\ a l 


Example 4.6 Let t = 9, q 

= 3. 



L 3 -association scheme: 

1A 

2B 

3C 


4C 

5A 

6 B 


IB 

SC 

9A 


0th Associate 1st Associates 2nd Associates 


□ 


897978564 
6, 4, 5, 2, 3, 1, 3, 1，2, 


989899998 
7, 7, 8, 7, 8, 7, 8, 7, 7, 
5, 6’ 6’ 6, 6’ 5, 6’ 5, 6, 
4，5, 4,5 ,4，4，4, 4，5, 
3, 3, 2, 3, 2, 3, 2, 3, 3, 


2 


2 


2 1 


4 
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4.6.5 Rectangular PBIB(3) Designs 

This association scheme was first proposed by Vartak (1959). Suppose that t = 
t\t 2 and that the treatments are arranged in a rectangular array with t\ rows 
and t 2 columns. Two treatments are said to be first associates if they occur 
together in the same row of the array; they are said to be second associates 
if they occur together in the same column; they are third associates otherwise. 
Hence 

= t2 — 1, n2 = t\ — l ^3 = (^l — l)te — 1) 

and 

"0 1 0 0 

1 t 2 -2 0 0 

1 = 0 0 0 t x - 1 

_0 0 h - 1 (h- 1 ) 02 - 2 ) 

"0 0 1 0 

p _ o 0 0 t2 — I 

1 0 ti-2 0 

_0 t 2 -I 0 (h - 2)(t 2 - 1 ) 

"0 0 0 1 

0 0 1 t 2 -2 

= 0 1 0 h-2 

t2~ 2 t\ —2 (t\ — 2)(t2 — 2) 


■0 0 1 
Pl= 0 ^ n\- P 

_0 n\ — ^ U 2 — n\ ^ 

Example 4.7 Let r = 5, 4 = 2, 办 = 3, a = 0, 0 = 1. With n\ = 2, the pos¬ 
sible differences dj — dy are — J 2 = — 1 = 4 and d 2 ~ d\ = 1; that is, d\ and 
d 2 occur a = 0 times among these differences and e\ = 1, ^2 = 4 occur 0 = 1 
time. Hence condition 2 is satisfied. We then have 

0th Associate 1st Associates 2nd Associates 


□ 


4 2 3 4 3 
10 12 0 


3 4 0 12 
2, 3, 4， 0 ， 1 ， 


0 12 3 4 
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4.6.6 Generalized Group-Divisible (GGD) PBIB(3) Designs 

In a more general way this association scheme was proposed by Roy (1953-1954) 
and described in more detail by Raghavarao (1960). 

Suppose that t = t\t:t 3 and that the treatments are arranged in a three- 
dimensional array with t\ rows, h columns, and layers. Two treatments are 
said to be first associates if they occur together in the same row and column, 
but in different layers; they are said to be second associates if they occur in the 
same row but in different columns (and the same or different layers); they are 
third associates otherwise. Hence 

n\ = t 3 — l H 2 = (t 2 — 1)^3 ^3 = (^1 — 1 )^ 2^3 

and 

-0 10 0 

1 t 3 -2 0 0 

P\ = 

0 0 (t 2 - l)t3 0 

_0 0 0 (^1 — 1)^2^3 

-0 0 1 0 

0 0 t 3 -l 0 

1 t3 — 1 (t2 — 2)^3 0 

_0 0 0 (^l — 1)^2^3 

-0 0 0 1 

0 0 0 t 3 - 1 

尸 3 = 

0 0 0 (t 2 - l)t3 

_1 ^3 — 1 (?2 — 1)^3 (^1 — 2 ) 以 3 



□ 

6 5 6 5 4 3 
4, 3， 2, 1, 2, 1, 


2 14 3 6 5 


5 6 5 6 3 4 
3, 4, 1, 2， 1’ 2, 


12 3 4 5 6 



2, 4, 6, 8, 10, 12 

1,3, 5,7, 9, 11 
2, 4, 6, 8, 10, 12 

1, 3,5,7, 9,11 

2, 4, 6, 8, 10, 12 
1,3,5, 7 ， 9, 11 
2, 4, 6, 8, 10, 12 
1,3,5, 7 ， 9, 11 
2, 4, 6, 8, 10, 12 

1,3, 5, 7, 9, 11 
2, 4, 6, 8, 10, 12 

1,3, 5,7, 9, 11 □ 


4.6.7 Generalized Triangular PBIB(3) Designs 

John (1966) has shown that the triangular association scheme discussed in Section 
4.5.2 can be described equivalently by representing the treatments by ordered 
pairs (x , y) with I < x < y < q and calling two treatments first associates if 
they have one integer in common, second associates otherwise. 

This can then be generalized (John, 1966) for the case that t = q(q — l)(q — 
2)/6 (q > 3) and that the treatments are represented by ordered triplets (x, y, z) 
with l<x<y<z<q. Two treatments are said to be first associates if they 
have two integers in common, second associates if they have one integer in 
common but not two, and third associates otherwise. Hence 

n\ = 3(q - 3) U 2 = 3(q - 3)(q - 4)/2 m = (q - 3)( 4)(q - 5)/6 

and 

■0 10 0 
1 q — 2 2(^ — 4) 0 

f — 1 _ 0 2(q -4) (q - 4) 2 (q - 4)(q - 5)/2 

_0 0 (q-4)(q-5)/2 (q - A){q - 5){q - 6)/6 
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Example 4.9 Let t = 12, t\ = 2, t 2 = 3, = 2. 

Association scheme: 

Layer 1 Layer 2 

1 3 5 7 9 11 

2 4 6 8 10 12 

0th Associate 1st Associates 2nd Associates 3rd Associates 
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Pi 


「0 0 1 
1 4 2(q - 4) 

1 2(q-4) (q + 2)(q - 5)/2 
L0 q — 5 (q — 5) (q — 6) 


0 _ 
q _ 5 

— 5)(^ — 6) 

(q — 5 )( 《一 6)(q — 7)/6. 


尸 3 


「0 

0 

0 


0 

0 

9 


0 

9 

9(q - 6) 


1 

Xq - 6) 

Xq~ 6)(q- 7)/2 


■1 3(q — 6) 3(q — 6)(q — 1)/2 (q — 6)(q — 7)(q — 8)/6. 


Example 4.10 Let t = 20, q = 6. The treatments are represented as 


2= (1,2, 4) 

3 三 

三 (1,2, 5) 

4 三 

三 (1,2,6) 

5^ (1,3,4) 

6 三 

三 (1,3,5) 

7 三 

三 (1,3,6) 


8 三 

= (1,4, 5) 

9 三 

三 (1,4, 6) 




10 三 

三 (1,5,6) 

11 = (2, 3, 4) 

12 E 

三（ 2, 3, 5) 

13 三 

三（ 2, 3, 6) 


14 E 

三（ 2, 4, 5) 

15 三 

三（ 2, 4, 6) 




16 三 

三（ 2, 5, 6) 


17 三 

三（ 3, 4, 5) 

18 三 

三（ 3, 4, 6) 




19 三 

三（ 3, 5, 6) 




20 三 

^ (4, 5, 6) 


and the association scheme follows: 


0th Associate 


1st Associates 


2nd Associates 


3rd Associates 


09876543210 

11 lx 11 11 lx 11 11 lx 11 11 


987654321 


90000009980009986653 

12222221112221111111 

8’ 8, oC oC oC 8’ 7’ 8’ 7’ 7’ oC 8’ 7’ 8’ 7’ 7’ 5’ 4’ 3, 2, 

T-H T-H t—H t—H T-H T-H t—H T-H t—H T-H T-H t—H T-H T-H T-H 

7’ 7 ，入 od irT 6 6 4’ 1 2’ 1’ 1 2’ 1, 1, 

T—H T—H T—H 

6,6,5,4’ 4’ 4’5,5,4’ 4,9,o, o, o, 0,9,o, 0,9, 7, 

T—H T—H T—H T—H T—H T—H T—H T—H 

5’ 3, 3, 2, 3, 3, 2, 2, 3, 3, 8’ 8’ oC oC 8’ 8’ oC 8’ 8’ 6’ 

IX IX IX IX IX 1M IX 11 lx IX 

4’ 2 ， 1’ 1 ， 2 ， 1’ 1, 1, 1, 2, 7’ 7’ veT 匕 7’ 7’ 7’ 匕 I/'T 

11 

o’ o’ oC 8’ o’ oC 8’ 7’ 6’ 5’ 6’ 5’ 5’5,5’ 6’ 3’ 4, 4’ 4, 
9’7’7’6’ 4’ 4’ 3’ 4’ 3’ 2’ 4’ 4’ 3’ 4’ 3’ 2, 2, 2’ 3’ 3’ 


35668990008990000009 
11 11 11 11 11 11 11 11 11 11 11 

2,4’ 4’ 5’ 7’ 7’ 8’ 7’ 8’ aC 7’ 7’ 8’ 7’ 8’ aC 8’ 8, 

1 ± 1 A 1 A 1 A 1 A IX IX lx 1 A il 11 iA IX 

1’ 1 ， 2, 3’ 1’ 2, 3’ 4’ 5’ 6’ 5’ 6’ 6’ 6’ 6’ 5’ 8’ 7’ 7’ 7’ 

11 11 11 11 lx 11 IX 11 11 11 IX 11 11 11 11 11 11 11 11 11 

7,9,0,0,9,0,0,0,0,9 ,4, 4,5,5’ 4’ 4, 4,5,6,6’ 

lx lx lx lx 11 lx 11 1^ 11 11 1^ 11 11 lx 11 11 

6’ 8’ 8’ oC 8’ 8’ oC oC 8’ 8’ 3’ 3, 2, 2, 3, 3’ 2’ 3, 3’ 5’ 

lx 11 11 lx 11 lx lx 11 lx lx 

5’ 5’ 6, 7’ 7’ 7, 6, 6, 7’ 入 2, 1’ 1 ， 1 ， 1 ， 2, 1 ， 1 ， 2, 4, 

11 lx lx 11 lx 11 11 IX 11 11 

4’ 4’ 4’ 3, 6’5’5’ 5, 5, 6’ 5, 6’ 7’ 00 ， 9,o, 00 ， 9,o,o, 
3, 3, 2, 2, 2, 3’ 4’ 3’ 4’ 4’ 2, 3’ 4’ 3’ 4’ 4’ 6, 7’ 7’ 9, 


, 2, 2’c 


2’ C 


-01234567890 

11 11 11 11 11 11 11 11 11 IX 


□ 
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4.6.8 Cubic PBIB(3) Designs 

This association scheme, described by Raghavarao and Chandrasekhararao 
(1964)，is an extension of the L 2 -association scheme discussed in Section 4.5.3, 
which can be described equivalently as follows: Denote the t = q 2 treatments 
by pairs (x, y) with I < x, y < q. Define the distance 8 between two treatments 
(x, y) and y f ) as the number of nonzero elements in (x — x f ,y — y f ). For 
the L 2 -association scheme then two treatments are said to be first associates if 
8=1, and second associates if 8 = 2. 

For the cubic association scheme we have t = q\ each treatment being rep¬ 
resented by a triplet (jc, z) with 1 < x, y, z < q. If the distance 8 between any 
two treatments (x, y, z) and z r ) is the number of nonzero elements in 

(x — x f ,y — y f , z — z f ), then two treatments are said to be first, second, or third 
associates if 5 = 1, 2, or 3, respectively. 

This can be interpreted geometrically by realizing that the above representa¬ 
tion of the treatments corresponds to an arrangement in a cube of side q. Two 
treatments are then first associates if they are lying on the same axis, second 
associates if they are lying on the same plane (but not on the same axis), and 
third associates otherwise. The design given in Example 4.1 has this association 
scheme with q = 2. 

The cubic association scheme implies immediately 


n\ = 3(q - 1) 


n 2 = 3(q - l) 2 113 = (q - l) 3 


and 


0 1 

0 

0 

1 q-2 

2 (q - 1) 

0 

0 2 (q - 1) 

2( 1)(^ — 2) 

(q - l) 2 

0 0 

(q - l) 2 

(Q ~ 1 ) 2 (分 _ 2) 


0 


0 


Pi = 



2 2 (^ — 2 ) q — 1 

2{q - 2) 2(q - l) + (q - 2) 2 2(q - l)(q - 2) 

g—1 2(q — 1)(^ — 2) (q — 1)(^ — 2) 2 


0 0 0 


0 0 3 

—. 0 3 6(q - 2) 

1 3(q - 2) 3(q - 2) 2 


Hq - 2) 
Hq - 2) 2 
(q - 2) 3 
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Other examples are mentioned in Section 18.7. 

4.6.9 Extended Group-Divisible (EGD) PBIB Designs 

These designs, which we shall abbreviate as EGD/(2 V — 1)-PBIB, were first intro¬ 
duced by Hinkelmann and Kempthome (1963) as an extension of the rectangular 
PBIB(3) design of Vartak (1959), and described in more detail by Hinkelmann 
(1964). The association scheme for these designs is as follows: 

Let there be t = t\t 2 ,... ， t v treatments denoted by (/i, «’ 2 ,... ， iv), where 
il = 1,2,..., ti and i = 1 ， 2,… ， v，and it is called the Ith component of the 
treatment. Two treatments are said to be yth associates, y = (yi, K2, • • •, Yv) 
and = 0 or 1(€ = 1, 2,..., y), if the treatments differ only in the compo¬ 
nents that correspond to the unity components of y. We then have m = 2 V — l 
associate classes as each component in y takes on the possible values 0, 1, and 
y = (0, 0, … ， 0) represents the trivial association corresponding to the 0th asso¬ 
ciate class of other PBIB designs. The number of associates in the yth associate 
class is denoted by n(y) and 

V 

«(y) = »(n, 72 , •••, Yv )= n^-i) W (4.52) 

e=i 

If we write the associate classes in lexicographic order, for example, for v = 3: 
000, 001, 010, Oil, 100, 101, 110, 111, and if we denote by 


Example 4.11 Let f = 8, o = 2. The treatments are represented as 


1 = ( 1 , 1 , 1 ) 
3 = (1,2,1) 
5 = (2,1, 1) 
7 ^(2, 2,1) 


Association scheme: 


0th Associate 1st Associates 


2 =( 1 , 1 , 2 ) 
4 = ( 1 , 2 , 2 ) 
6 =( 2 , 1 , 2 ) 
8 = (2, 2, 2) 


2nd Associates 3rd Associates 


□ 


6, 5, 5’ 6’ 3, 4, 4，3, 
4， 3, 2, 1，2，1， 1, 2, 


3, 4, 4, 3, 6, 5, 5, 6, 
2, 1, 1, 2, 1, 2, 3， 4, 


4 
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P 


(k) _ 

o _ 



0 

k — I 


Pi l) = 




the P matrices for a BIB design with 以 treatments, then we can write the P matri¬ 
ces for the EGD/(2 V — 1)-PBIB design recursively as the following Kronecker 
product: 


P 


yiY2---Yv-iO 


P 


yiY 2 --Yv-i 


P 


rm-rv-ii = ^nn-vv-i 


x P 


X P 


(t v ) 


(tv) 


(4.53) 

(4.54) 


where Pywi-Yv-i are the P matrices for an EGD/(2 V_1 — 1) design with t = 
t]_t 2 ,... ， t v -\ treatments. In particular then for v = 2: 


尸 oo = 



0l — 1)02—1) 


尸 01 = 


1 一 

0 

t2 — 2 

0 ti - 1 

h — 1 Oi — 1)(^2 — 2) _ 


^10 = 


1 

0 

0 

t2 _ 1 

0 t\ — 2 

0 

t2 — 1 0 

(h- 2)(t 2 - 1) 


尸 11 = 


0 1 

1 t 2 -2 

1 0 h-2 

t2 — 2 t\ — 2 (t\ — 2)(h — 2) 


which are, apart from the order of rows and columns, the same as those given 
for the rectangular PBIB(3) design (see Section 4.5.5). 
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Example 4.12 Let t = 36, t\ =t 2 — 3, ?3 = 4. The treatments are 


1 = ( 1 , 1 , 1 ) 
5 = (1,2,1) 
9 ^(1,3,1) 
13 = (2, 1, 1) 
17 = (2, 2, 1) 
21 = (2,3,1) 
25 ^ (3, 1, 1) 
29 = ( 3 , 2 , 1 ) 
33 = (3,3, 1) 


2 =( 1 , 1 , 2 ) 
6 = ( 1 , 2 , 2 ) 
10^ (1,3,2) 
14=(2, 1,2) 
18 = ( 2 , 2 , 2 ) 
22 ^ (2, 3, 2) 
26= (3,1,2) 
30 = (3, 2, 2) 
34 ^ (3, 3, 2) 


3 = (1’1,3) 
7^ (1,2,3) 
11 ^ (1,3,3) 
15 = (2, 1,3) 
19 = (2,2,3) 
23 = (2, 3, 3) 
27^(3,1,3) 
31 = (3,2,3) 
35 = (3,3,3) 


4=(1 ， 1,4) 
8 = (1,2, 4) 
12^ (1,3,4) 
16=(2, 1,4) 
20 = (2, 2, 4) 
24 = (2, 3, 4) 
28 = (3 ， 1 ， 4) 
32 = (3, 2, 4) 
36 = (3,3,4) 


The associate classes and the numbers of associates are 


0 = (0, 0, 0) n(0, 0, 0) = 1 

1 = (0, 0, 1) n(0, 0, 1) = 3 

2 = ( 0 , 1 , 0 ) «( 0 , 1 , 0 ) = 2 

3 = (0, 1, 1) «(0, 1, 1) = 6 

4= (1,0, 0) n(l,0,0) = 2 

5 = (1 ， 0,1) n(l,0, 1) = 6 

6 = (1, 1,0) n(\, 1,0) = 4 

7 =(1,1,1) n(l,l,l)= 12 


and the (partial) association scheme is given in Table 4.1 


□ 


4.6.10 Hypercubic PBIB Designs 

These designs, first indicated by Shah (1958) and more formally defined by 
Kusumoto (1965), are extensions of cubic PBIB designs (see Section 4.5.8). Sup¬ 
pose we have t = q v treatments, which are again denoted by (i\, « 2 ,... ， i v ) where 
il = \, 2,q and i = 1 ， 2, ... ， v. The association scheme for this design is 
then as follows: Two treatments are said to be Jth associates if they differ in 
exactly j components. Hence we have m — v associate classes, and the number 
of _/'th associates is 


= C.) (q - 1) (J. = 1，2,…， v) 

Following Shah (1958)，the general element of the P 人 - matrix (k = 0, 1,..., v) 
is given as 



’ u k i _ v 、 


V - 1 - u 


x (q- l) v ~ k ~ u (q - 2) k+i+ j +lu ~ 2v 


(4.55) 



Associate Classes 

—H 

—H 

—H 

18,19,20,22,23,24 

30,31,32,34,35,36 

17,19,20,21,23,24 

29,31,32,33,35,36 

17,18,20,21,22,24 

29,31,32,33,34,36 

17,18,19,21,22,23 

29,30,31,33,34,35 

14,15,16,22,23,24 

26,27,28,34,35,36 

00: 

S' 

30,31,32,34,35,36 

u-T 

cn 

ci 

*— 

13,14,15,17,18,19 

o 

—H 

17,21,29,33 

18,22,30,34 

19,23,31,35 

20,24,32,36 

13,21,25,33 

5,9,29,33 


4,8,16,20 


—H 

14,15,16,26,27,28 

13,15,16,25,27,28 

00 

CN 

(N 

U-T 

(N 

^6 

cn 

13,14,15, 25,26,27 

18,19,20,30,31,32 

2,3,4,26,27,28 


9,10,11,21,22,23 


o 

•n 

CN 

ro 

T—« 

CN 


00 

(N 

ON 



3 


—H 

—H 

26,27,28,10,11,12 

CN 

y—i 

—H 

oC 

00: 

<N 

oC 

cxT 

— 

— 

o' 

oC 

货 

u-T 

—H 

o' 

寸： 

rn 

CN 

18,19,20,22,23,24 


25,26,27,29,30,31 


o 


o 

i> 

CN 

oo 

Ol 

—H 



04 

CO 

06 

(N 


—H 

o 

寸 r 

CO 

CN 

寸的 

rn 

寸。 

(N 

CO 

CN 

— 

c« 

l> 

14,15,16 


33,34,35 


o 

o 

T—■ 

CN 


寸 

in •. 

• 2 


.汔 



42 uas;SJH9e = 、 51 爹 CQIad-p)aow SJ 0 )S91PS UO--SBPOSSV I.st:Ba rrfasBH 
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where /, y = 0, 1,..., v and the summation extends over all integer values of u 
such that 

u < min(v — i, v — j，v — k) 


and 


w > 去 (2v — i — j — k) 


and if for a given combination i, j,k no such u value exists, then = 0. Specif¬ 
ically, from (4.55) we obtain for y = 3 the P matrices for the cubic association 
scheme of Section 4.5.8. 


Example 4.13 Let t = \6, q = 2, v = A. The treatments are 


1 ^( 1 , 1 , 1 , 1 ) 
5 ^(1,2, 1,1) 
9 ^( 2 , 1 , 1 , 1 ) 
13 ^ (2, 2,1,1) 


2 二 （1,1,1，2) 
6 ^( 1 , 2 , 1 , 2 ) 
10 ^( 2 , 1 , 1 , 2 ) 
14 e (2, 2,1,2) 


3 二 （1，1，2, 1) 
7 ^(1,2, 2, 1) 
11^(2, 1,2, 1) 
15 ^ (2, 2, 2, 1) 


4 ^(1,1, 2, 2) 
8 ^ ( 1 , 2 , 2 , 2 ) 
12 ^( 2 , 1 , 2 , 2 ) 
16 ^ (2, 2, 2, 2) 


and the association scheme follows: 


0th Associate 1st Associates 2nd Associates 3rd Associates 4th Associates 


1 

2, 

3, 

5, 

9 

4, 6, 7, 

10, 

11, 

13 

8, 

12, 

14, 

15 

16 

2 

1, 

4, 

6, 

10 

3, 5, 8, 

9, 

12, 

14 

7, 

11, 

13, 

16 

15 

3 

1, 

4, 

7, 

11 

2, 5, 8, 

9, 

12, 

15 

6, 

10, 

13, 

16 

14 

4 

2, 

3, 

8 , 

12 

1 ， 6, 7, 

10, 

11, 

16 

5, 

9, 

14, 

15 

13 

5 

1, 

6, 

7, 

13 

2, 3, 8, 

9, 

14, 

15 

4, 

10, 

11, 

16 

12 

6 

2, 

5, 

8, 

14 

1 ， 4, 7, 

10, 

13, 

16 

3, 

9, 

12, 

15 

11 

7 

3, 

5, 

8, 

15 

1, 4, 6, 

11, 

13, 

16 

2, 

9, 

12, 

14 

10 

8 

4, 

6, 

7, 

16 

2, 3, 5, 

12, 

14, 

15 

1 ， 

10, 

11 ， 

13 

9 

9 

1, 

10, 

11, 

13 

2, 3, 5, 

12, 

14, 

15 

4, 

6, 

7, 

16 

8 

10 

2, 

9, 

12, 

14 

1, 4, 6, 

11, 

13, 

16 

3, 

5, 

8, 

15 

7 

11 

3, 

9, 

12, 

15 

1, 4, 7, 

10, 

13, 

16 

2, 

5, 

8, 

14 

6 

12 

4, 

10, 

11, 

16 

2, 3, 8, 

9, 

14, 

15 

1 ， 

6, 

7, 

13 

5 

13 

5, 

9, 

14, 

15 

1, 6, 7, 

10, 

11, 

16 

2, 

3, 

8, 

12 

4 

14 

6, 

10, 

13, 

16 

2, 5, 8, 

9, 

12, 

15 

1 ， 

4, 

7, 

11 

3 

15 

7, 

11, 

13, 

16 

3, 5, 8, 

9, 

12, 

14 

1 ， 

4, 

6, 

10 

2 

16 

8, 

12, 

14, 

15 

4, 6, 7, 

10, 

11 ， 

13 

2, 

3, 

5, 

9 

1 


4.6.11 Right-Angular PBIB(4) Designs 

The right-angular association scheme was introduced by Tharthare (1963) for t = 
2sq treatments. The treatments are arranged in q right angles with arms of length 
s, keeping the angular positions of the right angles blank. The association scheme 
is then as follows: Any two treatments on the same arm are first associates; any 
two treatments on different arms of the same right angle are second associates; 




152 


PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS 


any two treatments on the same (i.e., parallel) arm but in different right angles 
are third associates; any two treatments are fourth associates otherwise. 

It follows then that 

n\ = s — 1 H 2 = s ns = (q — 1)5 = 
and 

/0 1 0 0 0 \ 

1 s-2 00 0 

Pi = 0 0 ^ 0 0 

0 0 0 s(q - 1) 0 

、0 0 0 0 s(q — l)y 

/0 0 1 0 0 \ 

0 0 1 0 0 
P 2 = 1 s-1 0 0 0 

0 0 0 0 s(q - 1) 

、0 0 0 s(q — 1) 0 ) 

/0 0 0 1 0 \ 

000 s-l 0 

P 3 = 0 0 0 0 s 

1 s — l 0 s(q — 2) 0 

、0 0 s 0 s(q — l)y 

/0 0 0 0 1 \ 

000 0 s-l 

P 4 = 0 0 0 s 0 

0 0 5 0 s(q — 2) 

s — l 0 s(q — 2) 0 j 

Example 4.14 Let t = \2, q = ?>, s = 2. 

Treatments: 
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The association scheme follows: 


0th Associate 

1st Associates 

2nd Associates 

3rd Associates 

4th Associates 

1 

2 

3,4 

5 , 6 , 9 , 10 

7 , 8 , 11 , 12 

2 

1 

3,4 

5 , 6 , 9 , 10 

7 , 8 , 11 , 12 

3 

4 

1,2 

7 , 8 , 11 , 12 

5 , 6 , 9 , 10 

4 

3 

1,2 

7 , 8 , 11 , 12 

5 , 6 , 9 , 10 

5 

6 

7 , 8 

1 , 2 , 9 , 10 

3 , 4 , 11 , 12 

6 

5 

7 , 8 

1 , 2 , 9 , 10 

3 , 4 , 11 , 12 

7 

8 

5 , 6 

3 , 4 , 11 , 12 

1 , 2 , 9 , 10 

8 

7 

5 , 6 

3 , 4 , 11 , 12 

1 , 2 , 9 , 10 

9 

10 

11 , 12 

1 , 2 , 5 , 6 

3 , 4 , 7 , 8 

10 

9 

11 , 12 

1 , 2 , 5 , 6 

3 , 4 , 7 , 8 

11 

12 

9 , 10 

3 , 4 , 7 , 8 

1 , 2 , 5 , 6 

12 

11 

9 , 10 

3 , 4 , 7 , 8 

1 , 2 , 5 , 6 


□ 


Tharthare (1965) extended this association scheme to what he called the gen¬ 
eralized right-angular association scheme for t = pqs treatments with p > 2. To 
describe the association scheme succinctly, it is useful to denote the treatment 
by triplets (xi, X 2 , x^) with x\ = \,2,..., q \ X 2 = 1,2,..., p; = 1,2,..., s. 
Then, two treatments (x\,X 2 , X 3 ) and (^ 1 , y 2 , ^ 3 ) are said to be 


1st Associates: if x\ = yi,x2 = yi, ^ ^3 

2nd Associates: if x\ = X2 ^ yi 

3rd Associates: if x\ ^ y\,x 2 = y 2 


4th Associates: otherwise 


This, obviously, leads to = s — l, ri2 = (p — 1)^, = {q — l).y, 

=(q - i)(p - 1 )义 


4.6.12 Cyclic PBIB Designs 

Cyclic PBIB designs represent a rather large and flexible class of incomplete 
block designs. They were introduced by Kempthome (1953) and Zoellner and 
Kempthorne (1954) for blocks of size k = 2. Further developments are due to 
for example, David (1963, 1965), David and Wolock (1965), John (1966, 1969), 
and Wolock (1964). 

Let us denote the treatments by 0, 1, 2,— 1. The association scheme can 
then be defined as follows, where we need to distinguish between t even and 
t odd: 


t even: For a fixed treatment 0, the uih associates are (0 u,0 — u)mod 

= 1，2, ...， "2 — 1) and the t /2th associate is 汐 + "2 (mod t). Thus we have 
m = t/2 associate classes with n u =2{u = 1 , 2 ,..., t/2 — 1) and n t / 2 = 1 . 
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We draw the reader s attention to the cyclic development of the treatments in a 
given associate class, hence the name of the association scheme. The P matrices 
are as follows: 


4.6.13 Some Remarks 

We conclude this section with the following remarks: 

1. As we have mentioned earlier, the list of association schemes given here 
is not exhaustive. Other association schemes with m > 2 associate classes 
were given by for example, Roy (1953-1954), Raghavarao (I960) ， Oga- 
sawara (1965), Yamamoto, Fuji, and Hamada (1965), and Adhikary (1966, 
1967). In most cases they represent generalizations of existing association 
schemes. 


t odd.. The uth associates of 0 are (0 u,0 — u)moa t for u = 1, 2,— 

1)/2. Thus we have m = (t — 1)/2 associate classes with n u =2[u = 1,2,, 
(t~ D/2]. 

Example 4.15 Let t = 6. 

Oth Associate 1st Associates 2nd Associates 3rd Associates 


□ 


\\ // \\ / \\ // 
oolo oloo looo 

cu cu cucucuC.Ucu 

C3〕 IX n]u C3〕 nl.u 

oloo oolo oool 

/ - \ /- \ /-\ 


P 


2 


3 


3 4 5 0 12 


4 5 0 12 3 
2, 3, 4, 5, o’ 1, 


5 0 12 3 4 
1,2,3, 4, 5, o. 


0 12 3 4 5 
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2. We have chosen our list of association schemes because they lead to a 
rather large class of existing and practical PBIB designs [in particular the 
PBIB(2) designs and the cyclic PBIB designs are well documented (see 
Chapter 5)] and/or are particularly useful in the construction of systems 
of confounding for symmetrical and asymmetrical factorial designs (see 
Chapters 11 and 12). 

3. We emphasize that an association scheme does not constitute a PBIB 
design, nor does it lead automatically to a PBIB design. Methods of con¬ 
structing designs with a certain association scheme is the subject of Chapter 
5. It will turn out that certain association schemes are connected with certain 
construction methods. 


4.7 ESTIMATION OF p FOR PBIB(2) DESIGNS 

Recall from Sections 1.10 and 2.5 that the general problem with utilizing the 
combined intra- and interblock information is that of finding an estimator for p, 
p, such that 

var[r(p)] < var(0 (4.56) 

where t is the intrablock estimator for the estimable function c' x and t(p) is the 
combined estimator for the same function using p as the estimate of p. 

Shah (1964) has derived as estimator for p for a certain class of designs such 
that (4.56) holds. We shall state his result without proof and then apply it to the 
various types of PBIB(2) designs as in Section 4.6. 

4.7.1 Shah Estimator 

Let D\ be the class of proper, equireplicate, binary incomplete block designs for 
which the concordance matrix NN r has only one nonzero eigenvalue (other than 
rk). Let 0 denote this eigenvalue with multiplicity a = rank(iV^V ’）一 1. Further, 
define 

Z = c(l+ c){55(Z r I J) - (2T f - rr f )r} (4.57) 

where c = (rk/0) — 1 and all the other terms are as defined in earlier sections. 
We now state the following theorem. 

Theorem 4.1 Consider an incomplete block design belonging to the class 
D\. When 

Z rka 

if - > — 

MS^ 0 


0 


z 


rk — 0 \aMSE 


otherwise 
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with Z as given in (4.57), is used in the combined analysis, then for any treatment 
contrast c’t, 


var[r(p)] < var (0 


for all values of p provided that 


(a - A){n -t-b-l)>S (4.58) 


4.7.2 Application to PBIB(2) Designs 

We have already used the result of Theorem 4.1 in connection with BIB designs 
(see Section 2.5) that clearly belong to D\. We have mentioned earlier that among 
all PBIB designs the PBIB(2) designs are quite numerous and most useful from 
a practical point of view. As we have seen in Section 4.4.3，they have two 
distinct eigenvalues other than rk and expressions for these and their multiplicities 
are given in (4.31) and (4.32), respectively. Using these results we shall now 
investigate which PBIB(2) designs belong to the class D\ and give for them the 
corresponding condition (4.58). 

1. Group-Divisible Designs Using the explicit expressions for Pi and P 2 

as given in Section 4.6.1, we find that 

0\ = r — X\ a\ = t\ {t 2 — 1 ) 

汐 2 = r — 入 1 + 入 1 — 入 2 ) (X2 = t\ — \ 

In particular, we have, following Bose and Connor (1952), for 

a. Singular GD designs: r — 入 1 =0. Hence the designs belong to D\, with 
6 = O 2 and a = Condition (4.58) then becomes (t\ — 5)(n — b — t — 
1 )> 8 . 

b. Semiregular GD designs: r — h (入 1 — 入 2 ) = 0. Hence these 
designs also belong to D\, with 0 = 0\ and a = a\. Condition (4.58) 
can, therefore, be written as [t\ (t 2 — 1) — 4](n — b — t — l) > S. 

c. Regular GD designs: They do not belong to D\ since 沒 1 > 0 and O 2 > 0. 

2. Triangular Designs Using the results of Section 4.6.2 we obtain 

0\ = r {q — 4 ) 入 1 — (q — 3) 入 2 a\ = q — l 

沒 2 = r _ 2 入 1 + 入 2 c ^2 = — 3)/2 


They belong to D\ if either one of these eigenvalues is zero, which is 
possible. 
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3. Latin Square Type Designs For the L/-association scheme (see Section 
4.6.3) we have 

0\ = r _ (i _ g) ( 入 i _ 入 2 ) _ 入 2 c^i = _ 1) 

沒 2 = r _ /( 入 i _ 入 2 ) _ 入 2 o ?2 = (^ — 1 )(^ — / + 1 ) 

they belong to D\ if either 0\ or O 2 is zero, which is possible. 

4. Cyclic Designs Neither the expressions for the eigenvalues or their mul¬ 
tiplicities reduce to simple expressions. To check whether a given design 
belongs to D\, we need to work out (4.31) and check whether either 0\ or 
O 2 equals zero. 



CHAPTER 5 


Construction of Partially Balanced 
Incomplete Block Designs 


In the previous chapter we discussed several types of association schemes for PBIB 
designs with two or more associate classes. There exists a very large number of 
PBIB designs with these, and other, association schemes. But as we have pointed 
out earlier, the association schemes themselves do not constitute or generate the 
actual plan, that is, the assignment of treatment to blocks. To obtain such plans, 
different methods of construction have been developed and used. It is impossible 
to mention all of them as the literature in this area is immense. Rather, we shall 
discuss in some detail only a few methods with emphasis mainly on three types 
of PBIB designs, namely PBIB(2) designs, in particular, group-divisible PBIB(2) 
designs, cyclic PBIB designs, and EGD-PBIB designs. The first class is important 
because it constitutes the largest class of PBIB designs. The second class also 
represents a rather large class, and the designs are easy to construct and are widely 
applicable. The last class is important with respect to the construction of systems 
of confounding for asymmetrical factorial experiments (see Chapter 12). 

5.1 GROUP-DIVISIBLE PBIB(2) DESIGNS 

We shall give a brief discussion of the basic methods for constructing group- 
divisible designs. These methods are (i) duals of BIB designs, (ii) method of 
differences, (iii) finite geometries, and (iv) orthogonal arrays, as developed mainly 
by Bose, Shrikhande, and Bhattacharya (1953). 

5.1.1 Duals of BIB Designs 

Let iV be the incidence matrix of a design D with parameters t ， b ， k ， r. Then N’ 
denotes the incidence matrix of a design D\ with parameters t f , b f , k f , /, which 
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Treatments 

(0, 0) (0, 1) (0, 2) 

(1,0) (1，1) (1,2) 

( 2 , 0 ) ( 2 , 1 ) ( 2 , 2 ) 


Block 


has been obtained from D by interchanging treatments and blocks. This implies 
that D f has parameters t f = b,b f = t, k f = r,r f = k. The design D' is referred 
to as the dual of the design D (and, of course, vice versa). If the design D has 
a certain structure, such as being a BIB or PBIB design, then the dual design 
D f very often also has a particular recognizable structure. It is this relationship 
between D and D f that we shall now utilize to construct certain GD-PBIB(2) 
designs from existing BIB designs. Consider the following theorem. 

Theorem 5.1 Let D be a BIB design with parameters t = s 2 ,b = s(s + 1), 
k = s, r = s l, X = 1, where 5 is prime or prime power (see Sections 3.34 
and 18.7). Then the dual Z)’ is a GD-PBIB(2) design with parameters 〆 =(s + 
1)5, t\ = .? + I, t 2 = s, = s^, 众 ’ = 5" + 1, 〆 = 入 i = 0, 入 2 = 1. 

Proof As can be seen from the construction in Section 3.7, D is a resolvable 
(a = 1) BIB design, consisting of ^ + 1 replication groups with 夕 blocks each 
and containing each treatment once. Denote the blocks by Bji where j denotes 
the replication group (j = 1, 2 , ... ，夕 + 1) and i denotes the block within the 
jth replication group (£ = 1,2,, s). Now consider a treatment 0, say, and 
suppose it occurs in blocks B\^ l , B 2 i 2 ,..., Then in D r block 0 con¬ 

tains the treatments 2^2 , …， 5 + u i+\- Further, let O\,02,0 s -\ be the 
treatments that occur together with 0 in B\t x in D. Since {0, H … , ^-i} 
appear all in different blocks in the remaining 5 1 replication groups (since 入 =1 )， 
it follows then that in D' treatment \i\ occurs together exactly once with all 
ji{j = 2,..., s l; l = 1,2,..., s), but not at all with the s — l treatments 
\l(l li). This implies the association scheme 


11 

12 .. 

,. Is 

21 

22 .. 

,. 2s 

1 1 

5" + 1 2 .. 

,. s + 1 s 


where treatments in the same row are 1st associates and 2nd associates otherwise, 
which is, in fact, the GD association scheme, and since the GD association is 
unique (Shrikhande, 1952), the resulting design is the GD-PBIB(2) design. □ 

Example 5.1 Consider the BIB design D with t = 3 2 ,b = 12, k = 3, r = 
4, 入 = 1， given by the plan [denoting the treatments by (x\,X 2 ), jq, X 2 = 0, 1, 2]: 


、 ~/ \~/ \~s 

0 12 

2 , 2 , 2 , 

/_\ /IV ^_\ 

\—/ \—/ \ — y 

0 12 

11 11 11 

/V /IN /V. 

\—/ \l/ 

0 12 

0 , 0 , 0 , 

/IV r\ /IV 


12 3 12 3 
111 222 
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13, 21, 33, 43 
13, 22, 31, 42 
13, 23, 32, 41 


The reader can verify easily that this is indeed the plan for a GD-PBIB(2) 
design with the parameters as indicated. It is isomorphic to plan SR41 given 
by Clatworthy (1973). □ 

5.1.2 Method of Differences 

We denote and write the t = t\t 2 treatments now as follows: 

h 2 

(^1 — 1)/2 (5.1) 

that is, the (i + l)th group in the association scheme for the GD design consists 
of the treatments (/i, /2, … ， h 2 ) with i = 0, 1,— 1， these treatments being 
1st associates of each other. The treatments in the ith column are referred to as 
treatments of class t. 

The method of differences as described for BIB designs (Section 3.2) can now 
be extended for purposes of constructing GD-PBIB(2) designs as stated in the 
following theorem. 


0 i O 2 

h I 2 

(^l — l)i (h — 1)2 


The dual D is then given by 


Block Treatments 


(0,0) 

11 ， 

21 ， 

31, 

41 

(0,1) 

11 ， 

22 , 

32, 

43 

(0, 2) 

11 ， 

23’ 

33, 

42 

(1,0) 

12 , 

21 ， 

32, 

42 

(1 ，1 ) 

12 , 

22 , 

33, 

41 

(1 ，2 ) 

12 , 

23, 

31, 

43 


Block Treatments 


0 12 

2 , 2 , 2 , 

/IV /(\ /IV 


\ — \ \ — \ \ ― \ 

12 1 

2, 2, 1， 

/(V /IV /V- 

2 12 
1 ， o, o, 

\ v...\ v...\ 


\~y \~y \~s 

2 12 
25 25 15 

/IV /IV /IV 

\7 \—'\ 
12 1 

1, 0, 0, 

v\ ...\ ...\ 


\ ― / \—*\ \—*/ \—*/ \—*\ \—/ 

000 000 

o,12, 0, 1, 2, 

/l\ /V /(\ /IV r\ /IV 


12 3 12 3 
3 3 3 4 4 4 
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Let the initial blocks of size 4 be 

^10 = (0i, O2, li, I2) 

^20 = (0i, O 2 , 2i, 2i) 

^30 = (Oi, O 2 , 3i, 32) 

To verify conditions 2-4 of Theorem 5.2 we see that treatments of class 1 and 
2 occur r = 6 times. Further, the differences of type (1,1) are 

B\o- 0i — li = 6 li — 0i = 1 

B 20 ： 0i — 2i = 5 2\ — 0\ = 2 

B 30 : 0i — 3i = 4 3i — 0i = 3 


Theorem 5.2 Let it be possible to find s initial blocks 召 10, B 20 , ..., B s q 
such that 

i. Each block contains k treatments. 

ii. Treatments of class i{i = 1, 2, …，， 2) are represented r times among the 
s blocks. 

iii. Among the differences of type (£, i) arising from the s blocks, each 
nonzero residue mod t\ occurs the same number of times, 入 2 ， say, for 
all i = 1,2 ,..., ^2- 

iy. Among the differences of type (£, i r ) arising from the s blocks, each 
nonzero residue mod t\ occurs 入 2 times and 0 occurs 入 1 times, say, for 
all (£, f)(£, f = 1,2, 

Then, developing the initial blocks Bio, B 20 ,.. • ， B s q cyclically, mod t\ yields a 
GD-PBIB(2) design with parameters t = t\t 2 , b = t\s, k, r = /^/ 匕，入 1 ，入 2. 

The proof is quite obvious as it is patterned after that of Theorem 3.1. By 
looking at array (5.1), it is clear that through the cyclic development of the initial 
blocks, any two treatments in the same row occur 入 1 times together in the same 
block, and any two treatments not in the same row occur 入 2 times together in 
the same block. This then satisfies the association scheme for the GD-PBIB(2) 
design. 

We shall illustrate this with the following example. 

Example 5.2 Let t = 14, t\ =7,t2 = 2; that is, array (5.1) is 


2 2 2 2 2 2 2 

0 12 3 4 5 6 


lx IX IX IX lx lx 11 

0 12 3 4 5 6 
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and the same holds for differences of type (2, 2); similarly, differences of type 
(1, 2) are 


召 10: 

0i 

— O 2 = 0 

0i 

—I 2 三 6 

ii 

— 02 三 1 

h 

— ]_2 三 0 

忍 20: 

0i 

— O 2 = o 

0i 

— 22 = 5 

2i 

_ O 2 三 2 

2i 

_ 2 之三 0 

召 30: 

0i 

— O 2 = o 

0i 

— 32 = 4 

3i 

_ O 2 = 3 

3i 

_ 32 = 0 


and the same is true for differences of type (2, 1). Hence 入 i = 6, 入 2 = 1. The 
plan for the GD-PBIB(2) design with b = 21 blocks is then 


(Oi, O2, li, I2) 
(h, 12,2^22) 
(2i, 22, 3i ， 32) 
( 3 i, 32 , 4 i, 42) 
( 4 i, 42 , 5 i, 52 ) 
( 5 i, 52, 61,62) 
(61,62, Oi, O2) 


(Oi, O2, 2 i, 22) 
(li ， I 2 , 3i ， 32) 
(2i, 22, 4i ， 42) 

( 3 i, 32 , 5 i, 52 ) 
(4i,42, 61,62) 
( 5 i, 5 2 , 0 i, 0 2 ) 
(61,62, li, I2) 


(Oi, O2, 3 i, 32) 
(UA 2 A 1 A 2 ) 
(2i, 22, 5i, 52) 

( 3 i, 32, 61,62) 
( 4 i, 42, Oi, O2) 
( 5 i, 52, li, I2) 
(61,62, 2i, 22) 


which is the same as plan S13 of Clatworthy (1973). We note that this PBIB 
design is, of course, resolvable and that it lends itself to two-way elimination of 
heterogeneity in that each treatment occurs exactly three times in the first two 
and the last two positions of the 21 blocks. □ 


5.1.3 Finite Geometries 

We shall first consider a method of using a projective geometry, PG ( 欠， p n ), 
to construct GD-PBIB(2) designs. The result can then be applied similarly to a 
Euclidean geometry, EG(^T, p n ). (See Appendix B for details about projective 
and Euclidean geometries). The basic idea in both cases is to omit one point 
from a finite geometry and all the M-spaces containing that point. The remaining 
M-spaces are then taken as the blocks of a PBIB design. More specifically we 
have the following theorem. 

Theorem 5.3 Omitting one point from a PG(^T, p n ) and all the M-spaces 
containing that point, gives rise to a GD-PBIB(2) design, if one identifies the 
remaining points with the treatments and the remaining M-spaces with the blocks. 
Any two treatments are 1st associates if the line joining them goes through the 
omitted point, they are 2nd associates otherwise. 

Proof The PG(^T, p n ) has p n -\ - p 2n + • • • + p Kn points. Omitting one 
point leads to 

t = p >' + p 2n + ...+ P Kn 
=(1+ + ■ ■ ■ + P (K ~ l)n ) P n 

三 t\t2 (5.2) 
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treatments. In the PG(^T, p n ) there are ♦ (M ， K ， p n ) M-spaces, of which 
^(0, M, K, p n ) contain the omitted point. Hence there are 

b = 中 （ M ， K ， p n ) - <p(0, M, K, p n ) (5.3) 


M-spaces left that constitute the blocks, each M-space (block) containing 

k = i/(Q ， M ， p n ) =\+p n + p 2n + ... + p Mn (5.4) 

points (treatments). Since each point is contained in (p(0, M, K, p n ) M-spaces 
and since each line joining a given point and the omitted point is contained in 
cp(l, M, K, p n ) M-spaces, it follows that each retained point (treatment) occurs in 

r = <p(0 ， M ， K ， p n ) - (p(l,M,K,p n ) (5.5) 

retained M-spaces (blocks). Further 

«i = ^(0, \,p n )-2 = p n -\ = t 2 -\ (5.6) 

n 2 = t -m-l = (p n + p 2n + ... + p ” 

=(^i - 1)^2 (5.7) 

and 

= Q ， k 2 = (p(hM,K ， p n ) (5.8) 


Thus, we are obviously led to the association scheme for the GD-PBIB(2) design. 
Because of the uniqueness of the association scheme, the design constructed in 
this manner is a GD-PBIB design. □ 


As an illustration we consider the following example. 


Example 5.3 Let p = 3, K = 2,n = l. The points of the PG(2, 2) are given 
by triplets (u\, U 2 , M 3 ) with u\, U 2 , W 3 = 0, 1 except (M 1 M 2 M 3 ) = (000). Suppose 
we omit the point ( 100 ), then the remaining points are: 010 , 110 , 001 , 101 , 011 , 
111, that is, t = 6. The lines (i.e., M = 1) not passing through 100 constitute the 
blocks of the design and are given by 


001 

010 

Oil 

110 

001 

111 

010 

101 

111 

110 

101 

Oil 


(5.9) 


that is, b = = 3,r = 2. The association scheme is obtained by determining 

for each point which other points lie on the line going through it and the omitted 
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Inspection of the plan (5.9) shows that 入 i = 0, 入 2 = 1. With suitable labeling 
this is plan SR 18 of Clatworthy (1973). □ 

5.1.4 Orthogonal Arrays 

Orthogonal arrays (for a description see Appendix C) play an important role 
in the construction of experimental designs (see also Chapters 13 and 14). Of 
particular interest are orthogonal arrays of strength 2. This is the type we shall 
employ here to construct GD-PBIB designs. More specifically we shall employ 
orthogonal arrays OA[N, p, 2; 入 ]， where p is prime. Their relationship to the 
GD-PBIB(2) design is as follows: Replace any integer x appearing in the ith 
row of the array by the treatment (/ — l)p x. The /th row contains then the 
treatments 

(/ — 1 )/ 7 , (/ — 1 )/? + 1 ,— 1)/7 + /? — 1 

each occurring r = N/p times since each symbol occurs equally often in each 
row. The total number of treatments is t = Kp and we have t\ = K ， t 2 = p . The 
columns of this derived scheme form the blocks of the PBIB design, that is, 
there are 办 =TV blocks, each of size k = K. Treatments in the same row are 1st 
associates, that is, each treatment has n\ = p — l 1 st associates and since these 
treatments do not appear in any other row, it follows that 入 1 = 0. Treatments 
in different rows are 2nd associates, giving ri 2 = (K — \)p. Since we have an 
OA[b, k, p, 2; X], that is, each possible 2x1 column vector appearing X times, 
we have 入 2 = 入 . This is, of course, the association scheme of a GD-PBIB(2) 
design with the parameters as stated above. 


point. For example, the only point on the line going through (001) and (100) is 
obtained from (/z0 + vl, /x0 + vO, [i\ + vO) with /x = v = l, that is, the point 
(101). Doing this for all six points leads to the following association scheme: 


0th Associate 

1st Associates 

2nd Associates 

010 

110 

001 , 101 ， Oil, 111 

110 

010 

001 , 101 ， Oil, 111 

001 

101 

010, 110, Oil, 111 

101 

001 

010, 110, Oil, 111 

Oil 

111 

010 , 110, 001, 101 

111 

Oil 

010 , 110, 001, 101 


This is, of course, the GD-PBIB association scheme if we write the treatments 
in the following array: 


101 
11 11 11 

oil 

H 「3 1 —H 



CONSTRUCTION OF OTHER PBIB(2) DESIGNS 


165 


We illustrate this with the following example. 

Example 5.4 Consider the following OA[8, 4, 2, 2, 2]: 


A 


This leads to the derived GD-PBIB(2) design (with columns representing blocks): 


Block 
1 2 3 4 5 


6 7 8 


0 10 10 10 1 
22332233 
44445555 
67767667 


with f = 8, Z? = 8, /: = 4, r = 4, 入 i 
plan SR36 of Clatworthy (1973). 


0, 入 2 = 2. This design is isomorphic to 
" & □ 


For further specialized methods of constructing GD-PBIB(2) designs the reader 
is referred to Raghavarao (1971). An essentially complete list of existing plans 
for GD-PBIB(2) designs is given by Clatworthy (1973) together with references 
and methods concerning their construction. 


5.2 CONSTRUCTION OF OTHER PBIB(2) DESIGNS 

We shall mention here only a few methods but otherwise refer the reader to the 
pertinent literature. 

5.2.1 Triangular PBIB(2) Designs 

Recall that the number of treatments ist = q(q — 1)/2 and that for the association 
scheme the treatments are arranged in a triangular array and its mirror image 
(see Section 4.6.2). One method of constructing triangular designs can then be 
described as follows. 

If we let each row of the association scheme be a block, then the resulting 
design is a triangular PBIB design with parameters: 


q(q — 1)/2 b = q k = q — \ r = 2 


入 1 


入 2 = 0 


Example 5.5 Let r = 6 = 4 x The association scheme is 



n—0 1i cj 〕 

cucu 

C33 C3〕 IX 

C33 C3〕 

c — 3 11 c — 3 11 

li Cj3 Cj 〕 1A 

o o o o 

I_ 
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The design then is given by 


1 

2 

Block 

3 

4 


1 2 3 

1 4 5 

2 4 6 

3 5 6 


that is, /? = 4, /: = 3, r = 2, 人 i = 1 ，人 2 = 0. D 

Other methods given by Shrikhande (1960, 1965) and Chang, Liu, and Liu 
(1965) are based on the existence of certain BIB designs and considering, for 
example, the dual of the BIB design as omitting certain blocks from the BIB 
design. Still other methods are given by Clatworthy (1955, 1956), Masuyama 
(1965), and Ray-Chaudhuri (1965). A listing of practically useful designs is 
given by Clatworthy (1973). 


5.2.2 Latin Square PBIB(2) Designs 

Similar to the procedure described in the previous section, the following method 
of constructing L 2 -PBIB designs is closely connected to the square treatment 
array for the /^-association scheme. 

We have t = q 2 , and suppose q is prime or prime power. We know that 
there exist then q — I mutually orthogonal Latin squares (MOLS) of order 
q (see Section 1.10.6.2). The languages for these q — I MOLS are then 
superimposed on the treatment array, that is, each treatment has associated 
with it q — l letters, one from each of the q — I MOLS. Collecting the 
treatments that have the same letter from each of the q — I languages and 
calling those sets blocks, leads to an L 2 -PBIB design with the following 
parameters: 

t = b = q{q — 1) k = q r = q — 1 人 1=0 人 2 = 1 

Example 5.6 Let t = 16, q = A. The 4x4 association scheme with the 
superimposed three MOLS is as follows: 


1 

2 

3 

4 

Aa I 

B ll 

Cy III 

D8 W 

5 

6 

7 

8 

By IV 

A 8 III 

Dali 

c p l 

9 

10 

11 

12 

C 8 11 

Dy I 

IV 

Ba III 

13 

14 

15 

16 

III 

Cot IV 

B8 I 

Ay II 
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1(A) 
2(B) 
3(C) 
4(D) 
5 ⑻ 
6 ⑹ 
7(y) 
8(5) 
9 ⑴ 
10 ( 11 ) 
11 ( 111 ) 
12(IV) 


The blocks are then 

Block Treatments 


It is, of course, obvious from the method of construction that this PBIB design 
is a resolvable design with the blocks from each language forming a complete 
replicate. □ 

Other methods of construction, some based on the existence of certain BIB 
designs, have been given by for example, Bose, Clatworthy, and Shrikhande 
(1954), Chang and Liu (1964), Clatworthy (1955, 1956, 1967), and, as lattice 
designs, by Yates (1936b) (see Section 18.7). A list of these designs can be 
found in Clatworthy (1973). 

5.3 CYCLIC PBIB DESIGNS 

We shall now turn to the rather large class of cyclic PBIB designs we 
introduced in Section 4.6.12. These designs are quite useful since (1) they are 
easy to construct, namely as the name suggests through cyclic development 
of initial blocks, (2) they exist for various combinations of design parameters, 
and (3) they are easy to analyze, that is, their analytical structure is easy to 
derive. 

5.3.1 Construction of Cyclic Designs 

As mentioned earlier, the construction of cyclic designs is based on the cyclic 
development of a set of initial blocks. We can distinguish basically between four 
types of cyclic designs according to whether the number of blocks b is (1) b = t, 
(2) b = st, (3) b = t/d, or (4) b = t (s + 1/d), where d is a divisor of t. It 
is then obvious that 

For (1) we need one initial block of size k\ 

For (2) we need 5 distinct (nonisomorphic) initial blocks of size k; 


654343655634 
11 11 11 11 11 11 11 11 11 11 11 11 

12 9 0 2 10 9 0 9 2 1 
11 11 11 11 11 11 11 11 11 

6, 5, 8, 7, 7, 8,5,6’ 8, 7’ 6’ 5, 
1， 2, 3, 4, 1， 2, 3, 4， 1, 2, 3, 4, 
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with 人 i = 2, A，2 = 1, 人 3 = 0, and 



(0 

1 

0 

0\ 



1 

0 

1 

0 

Pl = 

1 = 

0 

1 

0 

1 


lo 

0 

1 

oj 



0 10 、 /o 0 0 l\ 
1 0 1 0 0 2 0 
010 3 = 0 2 0 0 
1 0 0/ \l o o oj 


For (3) we need one initial block of size k such that, if cyclically developed, 
block b-\-l=t/d-\- \ is the same as the initial block and each treatment 
is replicated r times. This means that if the initial block contains treat¬ 
ment i, it also must contain treatments i + /?,/ + 2b, + (J — \)b. This 
implies that d is also a divisor of k, say k/d = k f . Then any k f treatments 
can be chosen and the remaining treatments in the initial block 
are determined; 

For (4) we combine initial blocks from type (1) if ^ = 1 or (2) if 5 1 2 3 4 5 6 > 1 with 
an initial block from type (3). 

The choice of initial blocks for (1) ， (2), and hence (4) above is quite arbitrary 
since any choice will lead to an appropriate design. There is, however, one other 
consideration that plays an important role in choosing initial blocks and that is 
the efficiency E of the resulting design. Efficiency here is defined in terms of 
the average variance of all treatment comparisons (see Section 1.10)，and high 
efficiency is closely related to a small number of associate classes. Whereas the 
number of associate classes for a cyclic design is, in general, m = t/2 for t even 
and m = (t — 1)/2 for t odd, this number can sometimes be reduced by a proper 
choice of initial blocks, in some cases even to m = 1, that is, BIB designs, or 
m = 2, that is, PBIB(2) designs. For example, consider t = 6, k = 3, r = 3, b = 
6 : If the initial block is (1,2, 3)，then the plan is 

1 2 3 

2 3 4 

3 4 5 

4 5 6 

5 6 1 

6 1 2 

and by inspection we can establish the following association scheme: 

0th Associate 1st Associates 2nd Associates 3rd Associates 


4 


3 4 5 6 


4 

2, 


2 , 3 , 4 , 5 , 6 , 


12 3 4 5 6 
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with 入 i = 1 , 入 2 = 2 , and 

/O 10 、 /O 0 1 \ 

Pi = 1 2 1 P 2 = 0 4 0 

\0 1 0 / \l 0 0 / 

Hence we have in fact a PBIB(2) design. For the first design we obtain E = 
0.743 [see (5.14)], whereas for the second design we find E = 0.784. Hence the 
second design is slightly preferable from an efficiency point of view, and this 
design is therefore listed by John, Wolock, and David (1972). One can convince 
oneself that no better cyclic design for this combination of parameters exists. 

The association scheme for the second design is, indeed, the association 
scheme for a cyclic PBIB(2) design as discussed in Section 4.6.4, except that we 
have to relabel the treatments as 0, 1 ， … ， 5. We then have d\ = \, d 2 = 2, = 

4, d 4 = 5, e\ = 3, a = 2, )6 = 4. 

An extensive list of initial blocks for cyclic designs for various parameters with 
6 < ^ < 30, ^ < 10, r < 10, and fractional cyclic designs for 10 < ? < 60, 3 < 
^ < 10 is given by John, Wolock, and David (1972). A special group in this 
collection of plans are those with k = 2, which are often referred to as paired 
comparison designs. 

5.3.2 Analysis of Cyclic Designs 

The general methods of analyzing incomplete block designs (see Chapter 1) 
apply, of course, also to the cyclic designs just described, or even more specif¬ 
ically, the methods of analyzing BIB designs (where appropriate) or PBIB(m) 


Hence we have a PBIB(3) design. If, on the other hand, the initial block is 
(1, 2, 4), then the resulting plan is 

1 2 4 

2 3 5 

3 4 6 

4 5 1 

5 6 2 

6 1 3 

and, by inspection, we can speculate on the association scheme as being 

0th Associate 1st Associates 2nd Associates 
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designs can be used. However, as pointed out earlier, for many cyclic designs the 
number of associate classes, m, is rather large and hence the system of equations 
(4.34) becomes rather large. Instead, another method of solving the RNE 



Q 


for cyclic designs can be used. This method is interesting in itself as it makes 
use of the particular construction process that leads to a circulant matrix C and 
hence to a circulant matrix C_, the elements of which can be written out explicitly 
(Kempthome, 1953). 

Let us denote the first row of C by (ci, C 2 ,..., c t ). The second row is then 
obtained by shifting each element of the first row one position to the right (in a 
circulant manner), and so forth so that C is determined entirely by its first row. 
The same is true for C _ , so all we need to know in order to solve the RNE 
and to obtain expressions for variances of estimable functions X f r, is the first 
row of C—, which we denote by c 1 ， c 2 ,Let dj(j = 1 ， 2,, f) denote 
the eigenvalues of C where (Kempthorne, 1953) 

t 

dj = Y, - DO' - 1)0 (5.10) 


and 0 = 2n/t. Note that d\ = 0; hence di, (h, … ， d t are the nonzero eigenvalues 
of C. Then, as shown by Kempthome (1953), the elements c l {i = 1,2,... ,t) 
are given as 


c 



c 



t 

E 


cos(; - 1 )( / - 1)0 



(i = 2, 3,..., t) 


(5.11) 


Expressions (5.10) and (5.11) can actually be simplified somewhat. Because of 
the construction of the designs and the resulting association scheme, we have for 
the in NN f _ ^ 


入 12 = 入 U ， 入 13 = 入 1 ，卜 1 ， 

I 入 U/2 = 入 i ，， /2+2 for t even 

入 i ， (r+i)/2 = 入 i ， (?+3)/2 for t odd 

and hence we have for C 

k- 1 
c\ = r — — 
k 

_ 入 12 


C2 = C t = 


k 
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C3 = 

= C t -\ = 

k 


C4 = 

=Q-2 = 

— 入 14 


k 


c (t+l)/2 = 

= 印 +3)/2 

——人 l,(f+l)/2 

一 k 

for t odd 

c (t+2)/2 = 

_ —人 l,(r+2)/2 

k 

for t even 


As a consequence, also the maximum number of different c l values is (t + 1)/2 
for t odd or (t + 2)/2 for t even. With the c l values given in (5.11) we can find 
the variances of treatment comparisons in general via comparisons with treatment 
1 as follows: 


var ( 节 -tff) = var(fi — 务 _,_ +1 ) 

= 2(c* - c ir - i+l )a]- for i' > i (5.12) 

This is, of course, a consequence of the circulant form of C~. The average 
variance of all such treatment comparisons then is 

av. var = --- var(r/ — %) 

to - 1 ) ^ 

ii 

=^ var(ri -f/) 


and using (5.12), 


av. var : 


.w 




(c L -C l )o ； 



It 



1 〜 2 


(5.13) 


since EL〆 = o as can be verified directly by using (5.11). From (5.12) it 
follows then that the efficiency factor for the treatment comparison between the 
ith and the i f th treatment is 


Eu，= 


2 o e/ r = 

2(c l - /+1 )of 


r(c l — c"— /+1 ) 



172 


CONSTRUCTION OF PARTIALLY BALANCED INCOMPLETE BLOCK DESIGNS 


and from (5.13) that the overall efficiency factor is 

E _ 2 °e/ r _ ? ~ 1 

2tc x a}/(t — 1) rtc 1 


(5.14) 


5.4 KRONECKER PRODUCT DESIGNS 

In previous sections we have used the fact that incomplete block designs can 
sometimes be generated from some other existing incomplete block design, for 
example, the residual design of a BIB design is also a BIB design, the duals 
of certain BIB designs are PBIB(2) designs. In this section we shall use another 
widely applicable and useful technique to generate new incomplete block designs 
from existing designs. This method and hence the resulting designs are referred 
to as Kronecker product designs. These designs were first introduced by Vartak 
(1955) and later put on a more formal basis by Surendran (1968). 

5.4.1 Definition of Kronecker Product Designs 

Suppose we have two PBIB designs, D\ and D 2 , with m\ and m 2 associate 
classes, respectively [in what follows we consider a BIB design as a PBIB(l) 
design] defined by their respective incidence matrices N\ and N 2 with the fol¬ 
lowing parameters: 

D\\ ti,ri,ki,bi,n u ,X u P k = (p k uv ) (u,v,k = 0, 

(5.15) 

Z) 2 : py = (q f gh ) (f, g，h = 0,1 ， … ， m 2 ) 

Consider now the Kronecker product 

N = NixN 2 (5.16) 

where N is obtained by multiplying each element in N\ by the matrix N 2 - 
It is immediately obvious that N is an incidence matrix of dimension t x b 
with t = t\t 2 and b = b\b 2 . Furthermore, each row of N contains r = r\Y 2 unity 
elements and each column contains k = k'ki unity elements. Hence the design 
D ， defined by its incidence matrix (5.16), is an incomplete block design with t 
treatments, b blocks, r replicates per treatment, and k units per block. It remains 
for us to show that this Kronecker product design is indeed a PBIB design and 
to establish its association scheme and the remaining parameters. 

5.4.2 Properties of Kronecker Product Designs 

Let us denote the treatments of D\ by the vector 0 f = (^1, O 2 ,, 0 tl ) and the 
treatments of D 2 by the vector 0’ = (0i, 02,… ，也 2 ). The treatments for design 
D can then be defined by what Kurkjian and Zelen (1962, 1963) have called the 
symbolic direct product between 0 and 0. 
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Definition 5.1 Let 0 r = (H ... ， and = (0i ， 02 , ... ，也 ） be two 
arrays of size q and 5, respectively. Then a new array x of size ^ ^ is created 
by the symbolic direct product (SDP) of 0 and 0 as follows: 


XU 

X\2 


x = 0 <Si<l> = x\ s 
X2\ 


(5.17) 


Xqs 


□ 


It is convenient to enumerate the rows of the matrix N in (5.16) in the same way 
as the array in (5.17). We shall now prove the following theorem. 

Theorem 5.4 Let D\ and be two PBIB designs with parameters as given 
in (5.15). Then the Kronecker product design D as defined by its incidence 
matrix N = N\ x N 2 is a PBIB design with m = l)(m2 +1)—1 asso¬ 

ciate classes and the following association scheme: If treatments 0 V and 0 V > are 
wth associates in D\ and if treatments and cj)〆 are gth associates in D 2 
then the treatments x V[l and x v ，〆 are said to be (u : g)th associates in D with 
(u : g) = (0 : 0), (0 : 1 )， … ， （0 : m 2 )，（1 : 0)， …， （mi : m 2 ). 

Proof We shall prove the theorem by establishing that the concordance matrix 
and the association matrices satisfy certain conditions as established by Bose and 
Mesner (1959) and as used in Chapter 4. 

Let B u (u = 0, 1,..., mi) and = 0, 1,. .. ， m 2 ) be the association matri¬ 
ces for designs D\ and D 2 , respectively. We then have 

A^iiV ； = (5.18) 

u=0 

and 

m 2 

N 2 N , 2 = Y / X*B* (5.19) 

g=0 

Then, using (5.16) ，（ 5.18)，and (5.19)，we obtain 

NN r = (Ni x ^V 2 )(iVi x N 2 y 
=(iVi^V；) x (iV 2 iV；) 
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m\ 


m 2 


I x I 


、 M=0 

m\ m 2 


U=° 


{^ u x B 

u=0 g=0 


Now let 


^u'q ― ^u x B 


， M :茗 


(5.20) 


(5.21) 


It can be seen easily that the B u:g are association matrices since 
(i) B 0 ：o = BxB ^ 0 = I t 

mi m 2 

⑼ X] = Bu x 


u=0 g=0 


(iii) B ll ： f ,B V[h 


M g 


= (!>») X (!>;) = W X 0 t2 0 ； 2 =^r0 ； 

[b u x B；) (B v x Bl) = {B U B V ) x 


'mi 


m 2 


x ( Ty g h B y 

/ =0 


\k=0 / 

m\ m 2 

H Puv^gh ( B k X B 

k=0 / =0 
m\ m 2 

p^ih B k：f 

k=0 / =0 
m\ m 2 

a Puiv：h B k-.f 

k=0 / =0 


with 


and 


k:f 

P u:g,v:h 



k:f k:f 

P u:g,v:h — P u:h,u:g 

Further, the association scheme defined by the association matrices (5.21) is 
indeed the same as that stated in the theorem, which is immediately obvious 
from the way the treatments of N have been enumerated in (5.17). 

This completes the proof of the theorem. 


□ 
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By way or proving the theorem, we also obtain the remaining parameters or 
the Kronecker product design D as 


l u：f 


n u n 


f 


K：f = 入 A} 

p k'f = {p k u fg,v\h) 

= (p k uv - P f gh) 

=Pk X P*f 

with (w : /) = (0 : 0), (0 : 1),. • •, (0 : m 2 ), (1, 0),..., (m 1 : m 2 ). 

As a simple application we consider the following example. 

Example 5.7 Let D\ be a BIB design with parameters t\ = A, b\ 
3, r\ =3(= 入 0 ) ， 入 l = 2, and incidence matrix 


4, k x 


Ni 


(\ 

1 

1 

V 。 


0 、 

1 

1 

1 


and let D 2 be a BIB design with 
and incidence matrix 


3, b 2 = 3, 众 2 = 2, r〗= 2(= Aq), A* = 1, 



Then D has the parameters t = 12, b 
入 1:0 = 4 , 入 i:i =2, and incidence matrix 


I 2 ,k = 6 ,r = 6(= 入 0 : 0 )， 入 0:1 = 3; 


= In 万 1 = ~ 

^0 = lt 2 B\ = 0 t2 J； 2 - I tl 

as the association matrices for D\ and D 2 , respectively, the association matrices 
for D are 

^0:0 = Itl X lt 2 = 

Bo-a = It, x (3t 2 0； 2 - In) = In x ^t 2 0； 2 - I, 


2 2 2 

OJVJVJV 

2 2 2 

^ o ^ ^ 

2 2 2 

ivivoiv 

2 2 2 丨 

JVJVJVO 

//m\ 

II 



0:0 associates if 
0:1 associates if 
1:0 associates if 
1:1 associates if 
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^i：o = (3^3^-^)x7^ = 0 tl 0 f h x I t2 - I t 

= ( li % _ ’，1) X (心义 _ ’，2) 

= Ot^t ~ ^h^t\ X X 

Obviously 

Bo：o + Bo：\ + B\：o + B\ : \ = 0 t 0 f t 

Any treatment in D is denoted, according to (5.17), by (/, j) where i = 
1, 2, 3, 4, j = 1, 2, 3, and these treatments are arranged in the order as given 
by (5.17). The association scheme for this PBIB(3) design can then be expressed 
in two ways, both of which are useful: 

1. Any two treatments (/, /) and (/’，）’）are 


2. If the treatments are arranged in the 4 x 3 array 


then each treatment is the 0:0th associate of itself, and any two treatments 
are 

0:1 associates if they are in the same row 

1:0 associates if they are in the same column 

1:1 associates if they are in different rows and columns 

This association scheme has been referred to as the extended group-divisible 
association scheme for an EGD(3)-PBIB design by Hinkelmann and Kemp- 
thorne (1963) using representation 1 above, and as the rectangular associ¬ 
ation scheme by Vartak (1955) using representation 2 (see Sections 4.6.9 
and 4.6.5). 


II 


• It • i • Tt • i 

ll ll 


、 ~s \~s \~s \~s 

3 3 3 3 
1, 2， 3, 4, 

/ V /■_\ /V V _\ 

\—^ \—^ 

2 2 2 2 
1，2，3，4, 

/V /IV v—\ v—\ 

\—/ \)/ \)/ \)y 

11 11 11 11 

1, 2, 3, 4, 

/l\ /l\ /l\ /l\ 


Finally, the P matrices for the design D are obtained from those for design D\ ： 
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and design 


n 


0、 

,0 2j 


Pi 


r 

、i i ‘ 


as 


尸 o:o 


0 


r v •/丄 i 

0 

2 


1 1 

3 

尸 0:1 = 

0 3 

0 

L i 


0 


6 / 


0\ 



0 


0 

尸 1:0 = 

0 2 

尸 i:i = 

1 1 


1 0 2 0 


0 1 0 2 


\0 2 0 4/ 


\1 


0 1\ 


2 2 / 


□ 


The method of constructing PBIB designs as Kronecker product designs can, 
obviously, be extended to Kronecker products of incidence matrices for three or 
more existing PBIB designs, for example, 

N = N\ x N 2 

and the reader should have no difficulties working out the parameters of the result¬ 
ing PBIB design with (m\ + l)(m 2 + 1)(^3 + 1) — 1 associate classes, where 
mi, m 2 , m 3 are the numbers of associate classes of the component designs. 


5.4.3 Usefulness of Kronecker Product Designs 

It is, of course, clear that the usefulness of Kronecker product designs is somewhat 
limited as even for two component designs, that is, for N = N\ x N 2 they may 
require rather large block sizes and/or large numbers of replications depending on 
the corresponding parameters for the underlying designs N\ and N 2 - In general, 
therefore, only those designs N\ and N 2 with moderate values for k and/or r 
will be useful in this connection. 

Also, the number of associate classes for Kronecker product PBIB designs may 
be quite large since the maximum number of associate classes for N = N\ x N 2 
is m\m 2 + m\ + m 2 . Vartak (1955) and Kageyama (1972) have shown, however, 
that under certain conditions this number can be reduced considerably, thereby 
actually inducing a different association scheme. Kageyama (1972), for example, 
proves that a necessary and sufficient condition for an EGD-(2 y — 1)-PBIB design 
(see Section 4.6.9) based on the v-fold Kronecker product of BIB designs to 
reduce to PBIB(v) designs having the hypercubic association scheme (see Section 
4.6.10) is that the v BIB designs have the same number of treatments and the same 
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block size. This not only makes Kronecker product PBIB designs more attractive 
from a practical point of view but it also provides methods of constructing PBIB 
designs with different association schemes. 

5.5 EXTENDED GROUP-DIVISIBLE PBIB DESIGNS 


We have mentioned earlier that the EGD-PBIB designs play a major role in 
connection with systems of confounding for asymmetrical factorial experiments 
and balanced factorial experiments (see Chapter 12). Knowing how to construct 
EGD-PBIB designs therefore means knowing how to construct such systems 
of confounding. It is for this reason that we shall present a few methods of 
constructing EGD-PBIB designs. 

5.5.1 EGD-PBIB Designs as Kronecker Product Designs 

As illustrated already by Example 5.7, the EGD-PBIB(3) design can be con¬ 
structed as the Kronecker product of two BIB designs. It is then quite obvious 
that a v-fold Kronecker product of BIB designs will lead to an EGD-PBIB(2 v — 1) 
design. More formally, we state the following theorem. 

Theorem 5.5 Let there be v designs Di with parameters ^, bi , k [, r/ ，入 (/) and 
incidence matrix Ni(i = 1,2,..., v), respectively. Then the Kronecker product 
design D as given by its incidence matrix 

N = N\xN2 乂 ， ..xN v 

is an EGD/(2 V — 1)-PBIB design with parameters 

V V V 

^ = n^ ^=w^i ^ 

i=\ i=l i=l 

i=l 

where the power y/ = 0 or 1. 

Proof For the proof we simply refer to the definition of the Kronecker prod¬ 
uct design (Section 5.4) and the definition of the EGD/(2 y — 1)-PBIB design 
(Section 4.6.9). □ 

5.5.2 Method of Balanced Arrays 

The second method of constructing EGD-PBIB designs is due to Aggarwal 
(1974). It is based on balanced arrays. Before we give the corresponding theorem 
we need first the following definition. 


V 

r = ]~[ 

i=\ 
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Definition 5.2 Let a! = (a\,a 2 , ..., a q ) and )8 ’ = d 芦 2, , Pq) be two 
arrays of q elements, then the symbolic inner product (SIP), a' O , of these 
arrays is the array defined as 

a r O = (a\ P\ ， ot2P2,... ， O^qPq) n 


We then state the following theorem. 


Theorem 5.6 The existence of an EGD/(2 V — 1)-PBIB design with param¬ 
eters t = ]~~[ ? _^ ti, b = U{ti — 1), k = ti, r = ]~I/ = 2 (G — 1)，and 

_ J 1 if yi = y 2 = ... = y v = l 

nY2--Yv |q otherwise 

where t\ < t[(i = 2, 3,... ， v)，is implied by the existence of v — 1 balanced 
arrays BA/[^(r/ — 1), ti, 2](i = 2,3,..., v), that is, with Ni = ti {ti — 1) 
assemblies, t\ constraints, strength 2 , ti levels, and for each i 


入 ( 又 1, X2)= 



if X\ = X2 

otherwise 


[for a definition of balanced array (BA) see Appendix C]. 


Proof This proof actually gives the construction of the EGD-PBIB design, 
assuming that the BA/ exist. Denote the columns of BA/ by an, aa ,..., a/ ， ( 々 _i) 
and the elements in BA/ by 0, 1，…， h — 1. Then construct successively the 
following EGD-PBIB designs: 

1. Take the SIP of the array [0, 1, 2, ... ， ⑺ 一 1)] and each a f 2 j[j = 
1, 2,… ，， 2(，2 — 1)] of BA 2 . If each SIP is taken as a block with treatments 
of the form (x\X 2 ), then this yields an EGD(3)-PBIB design with the fol¬ 
lowing parameters: t = t\t 2 , b = tiih — l), k = t\, r = t 2 — l, noi = t:— 
1， nio = t\ — n\\ = (t\ — 1)(>2 — 1), 入 oi = 0, 入 io = 0 ，入 ii = 1. This 

can be verified easily by noting that (a) each element of BA 2 must occur 
t 2 ~ I times in each row of BA 2 , hence each element in (0, l,..., t\ — l) 
is combined with each element of (0, 1 ， • • •， h — 1), implying r = t 2 _ U 
(b) each SIP forms a block of size k = t\ and hence 办 =— 1); (c) 
because of the BA property concerning the X(x\,X 2 ), no two pairs of ele¬ 
ments, that is, no two treatments (/, j) and (/’，/), say, occur together in 
the same block if i = i' or j = j\ implying 入⑴ = 入 ⑴ = 0, and two pairs 
of elements (/, j) and (/’ ， j r ) occur together exactly once if i ♦ i’ and 
j ^ /, implying 入 11 = 1. This establishes the EGD(3)-PBIB property of 
this design. We denote the blocks, that is, SIP arrays, of this EGD-PBIB 
design by b' 2V b' 22 ,..., b' 2 h(t2 _ ly 
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The SIP of (0, 1, 2) with the columns of BA 2 yields 

b' lx = ( 00 , 12 , 21 ) 

* 2 2 = ( 01 , 10 , 22 ) 

*23 = (02, 11,20) 

^ 24 = ( 00 , 11 , 22 ) 
b f 25 = ( 01 , 12 , 20 ) 
b f 26 = ( 02 , 10 , 21 ) 

The SIP of the by's with the columns of BA 3 yields the final plan with 72 
blocks of size 3 as given in Table 5.1. □ 


5.5.3 Direct Method 

The methods described in Sections 5.5.1 and 5.5.2 depend both on the avail¬ 
ability of appropriate BIB designs or BAs, respectively. The method to be 
described in this section, which is due to Chang and Hinkelmann (1987), is 
a direct method in the sense that it is based ab initio on simple combinatorial 
operations. 


2. Then take the SIP of the = 1,2,..., — 1)] with the = 

1, 2,..., ^ 3(^3 — 1)] of BA 3 . This leads to an EGD/7-PBIB design 
with parameters t = ht 2 t 3 , b = — — l), k = t\, r = (t 2 — 1 ) 

03 _ 1)， 入 001 = 入 010 = 入 Oil = 入 100 = 入 101 = 入 lio = 0，入 111 = 1，^100 = 

t\ — 1, «oio = h — 1, nooi = G — 1, ^110 = (^1 — 1)02 — 1 )， ^ioi = 
0l — 1)03 — 1), noil = (t 2 — 1)03 — 1) ， n\u = (t\ — 1)02 — 1)03 — 1). 
This can be verified by repeating the arguments given in item 1. 

3. Continue the process of forming the SIP of the blocks of the EGD/(2 q — 1)- 

PBIB design with the columns of BA^ + i until q = v — The resulting 
design is then an EGD/(2 y — 1)-PBIB design with parameters as stated in 
the theorem. □ 

Example 5.8 Let ? = 36 = 3x3x4, that is, t\ = 3, ^ = 3, = 4. The two 

required BAs are (see Appendix C for construction) 

/ 0 1 2 0 1 2 \ 

BA 2 =201120 

l 1 2 0 2 0 1 ) 



3 0 2 

2 13 
12 0 
0 3 1 

3 10 
2 0 1 
13 2 
0 2 3 
3 2 1 
2 3 0 
10 3 
0 12 



B 
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Table 5.1 EGD-PBIB Design for 
t = 36, b = 72, A: = 3, and r = 6 


(000, 121, 212) 
(001, 120, 213) 
(002, 123, 210) 
(003, 122, 211) 
(000, 122, 213) 
(001, 123, 212) 
(002, 120, 211) 
(003, 121 ， 210) 
(000, 123, 211) 
(001, 122, 210) 
(002, 121, 213) 
(003, 120, 212) 

(020, 111, 202) 
(021 ， 110, 203) 
(022, 113, 200) 
(023, 112, 201) 
(020, 112, 203) 
(021, 113, 202) 
(022, 110, 201) 
(023, 111, 200) 
(020, 113, 201) 
(021, 112, 200) 
(022, 111, 203) 
(023, 110, 202) 

( 010 , 121 , 202 ) 
(Oil, 120, 203) 
(012, 123, 200) 
(013, 122, 201) 
(010, 122, 203) 
(Oil, 123, 202) 
( 012 , 120 , 201 ) 
(013, 121, 200) 
(010, 123, 201) 
(Oil, 122, 200) 
(012, 121, 203) 
(013, 120, 202) 


( 010 , 101 , 222 ) 
(Oil, 100, 223) 
(012, 103, 220) 
(013, 102, 221) 
(010, 102, 223) 
(Oil, 103, 222) 
( 012 , 100 , 221 ) 
(013, 101, 220) 
(010, 103, 221) 
(Oil, 102, 220) 
(012, 101, 223) 
(013, 100, 222) 

( 000 , 111 , 222 ) 
(001, 110, 223) 
(002, 113, 220) 
(003, 112, 221) 
(000, 112, 223) 
(001, 113, 222) 
(002, 110, 221) 
(003, 111, 220) 
(000, 113, 221) 
(001, 112, 220) 
(002, 111, 223) 
(003, 110, 222) 

( 020 , 101 , 212 ) 
(021, 100, 213) 
(022, 103, 210) 
(023, 102, 211) 
(020, 102, 213) 
(021, 103, 212) 
( 022 , 100 , 211 ) 
(023, 101, 210) 
(020, 103, 211) 
( 021 , 102 , 210 ) 
(022, 101, 213) 
(023, 100, 212) 


Before we state the main result, we introduce, for ease of notation, the fol¬ 
lowing definition. 


Definition 53 For x and y intergers and x > y, let 

少 = 占 = 咖― 1 ) … (n + 1) 


(5.22) 

□ 
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We then have the following theorem. 


Theorem 5.7 For t = Yl^=i U treatments with t\ < t 2 < •― < t v there exists 
an EGD/(2 y — 1)-PBIB design with parameters 

， = n" 办 =n，! (ri) 
i=l i=2 

v 

r = 「[⑺ ■- 1)(， 1_1) k = h 

i=2 

v 

HO, 0,… ’ 0) = r 入 (1，• • ” 1) = 汍 - 1 2 3 ) ㈣) 

i=2 

all other X(y) = 0 


The proof is rather elementary but quite lengthy. We shall not go through all 
the details here and instead refer the reader to Chang and Hinkelmann (1987). 
Just as for Theorem 5.6, however, the proof is based on the actual construction 
of the design, and the individual steps of the method will be described now. To 
do so, it is useful to introduce some notation. 

Let (x\,X 2 , … ， x v ) denote a treatment with x/ 6 {l, 2,..., ti} for i = 
1, 2,..., y and is called the ith component of the treatment (see Section 
4.6.9). We then define a matrix 




p ⑵ 

P = 

p(h) 


(5.23) 


of dimension t/j x t v with the submatrix of dimension x 

t v . (In what follows we shall abbreviate Y\ an d by ]"[*). The 

elements of P are the treatments (x\,X 2 ,..., x v ), and P ⑹ has the following 
properties: 


1. The first component of any element is € for € = 1, 2,..., ?i. 

2. The last component of any element in the sth column is 5 (1 < 5 < t v ). 

3. The remaining components of any element are changed from one row to 
the next by first changing the (v — l)th component from 1 through ， v _i, 
then the (v — 2 )th component from 1 through t v _ 2 , and so on. 
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For example, for v = 

: 3 the submatrix P ⑹ 

is of the form 



(£,h2) 

… (i, IJ 3 ) 


(£, 2, 1) 

(£, 2, 2) 

… (£, 2, 


_(£, t 2 , 1) 

(£, t 2 , 2) 

... (£, t2, t^) 


(5.24) 


for i = 1, 2 ,... ， 

The next step is to define a matrix derived from P, 


分 2 

Q(P)= (5.25) 


LU 


of order t\ x t v such that is a row vector from (i = 1 , 2 ,..., t\) and 
any two vectors, q ⑹ and do not have the same row number in P ⑹ and 
P^'\ respectively. Denote the possible choices of Q(P) by Qi ， Q 2 ,..., 
(suppressing P), where 


n = n*r ; (；l) 


Example 5.9 Let t = 24, t\ = 2, t 2 = 3, = 4. The P matrices as given by 

(5.24) are 


With N = 


p' 1 '= 

"(1,1,1) 

(1,1,2) 

(1,1,3) 

(1,1,4)' 

(1,2, 1) 

(1,2,2) 

(1,2, 3) 

(1,2,4) 


_(1’3, 1) 

(1,3,2) 

(1,3,3) 

(1 ， 3,4)_ 

p< 2 » = 

"(2, 1, 1) 

(2,1,2) 

(2,1,3) 

(2,1,4)' 

(2, 2, 1) 

(2, 2’ 2) 

(2, 2, 3) 

(2, 2,4) 


(2,3, 1) 

(2, 3, 2) 

(2, 3, 3) 

(2,3,4)_ 

= 6 we then have 




Si = 

"(1,1,1) 

(1,1,2) 

(1,1,3) 

(l, 1,4)" 

_(2, 2, 1) 

(2, 2, 2) 

(2, 2, 3) 

(2, 2, 4)_ 

62 = 

"(1,1,1) 

(1,1 ， 2) 

(1,1,3) 

(l, 1,4)" 

J2, 3, 1) 

(2, 3, 2) 

(2, 3, 3) 

(2,3,4)_ 

63 = 

"(1,2,1) 

(1,2, 2) 

(1,2, 3) 

(1,2,4)" 

(2, 1’ 1) 

(2,1,2) 

(2,1,3) 

(2, 1,4) 
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Qa = 

"(1,2, 1) 

(1,2, 2) 

(1,2, 3) 

(1,2,4)' 

(2,3,1) 

(2, 3, 2) 

(2, 3, 3) 

(2,3,4)_ 

05 = 

"(1,3, 1) 

(1,3,2) 

(1,3,3) 

(1,3,4)' 

.(2, 1,1) 

(2,1,2) 

(2,1,3) 

(2 ， 1 ， 4)_ 

06 = 

"(1,3,1) 

(1,3,2) 

(1,3,3) 

(1,3,4)' 

(2, 2,1) 

(2, 2’ 2) 

(2, 2, 3) 

P ， 2,4) 


The PBIB design is now obtained by forming, for each Qi(l = 1,3,..., N), 
arrays 

= {btjuhj2, … ， (5.26) 

such that 


1. biji is an element of the ith row of Q^. 

2. For / # /’，biji and are selected from different columns of Q^. 

Thus for any given i, there are ty such Bij. The total number of arrays then is 

b = Nx x ti h) = fj 4 h) (5.27) 

i=2 

Each array (5.26) represents a block of size t\ and (5.27) gives the total number 
of blocks. Hence the collection of all Bij's is the desired EGD/(2 V — 1)-PBIB 
design. 

From we obtain the following 12 B\j\ 


及 11 

(1,1,1) 

(2, 2, 2) 

忍 14 : 

(1,1,2) 

(2, 2, 1) 

B\2 

(1,1,1) 

(2, 2, 3) 

忍 15 : 

(1,1,2) 

(2, 2, 1) 

Bn 

(1,1,1) 

(2, 2, 4) 

忍 16 : 

(1,1,2) 

(2, 2, 4) 

忍 17 

(1,1,3) 

(2, 2, 1) 

忍 10 : 

(1,4, 2) 

(2, 2, 1) 

忍 18 

(1,1,3) 

(2, 2, 2) 

^11 ： 

(1,1,4) 

(2, 2, 2) 

B\9 

(1,1,3) 

(2, 2, 4) 

忍 12 : 

(1,1,4) 

(2, 2, 3) 


The other blocks can be constructed in a similar way. □ 

It is, of course, obvious from the design above that only treatments that differ 
in all three components appear together in the same block. In general, this is 
assured by the definition of the P matrices in (5.24), the Q matrices in (5.25) 
and the B arrays in (5.26). Hence, only 入 （ 1, > 0. 
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Finally, we point out that if t\ =2 (as in Example 5.9) the method just 
described yields the same design as Aggarwal’s (1974) method described in 
Section 5.5.2. 

5.5.4 Generalization of the Direct Method 

As stated in Theorem 5.7 the direct method leads to EGD-PBIB designs with 
block size k = t\. Following Chang and Hinkelmann (1987) we shall now show 
that the theorem can be modified and used to construct certain EGD-PBIB designs 
with blocks of size k = ti where 2 < i < v. 

Suppose we have t = t\ x h x ... x 心 treatments with t\ < t 2 < • • • < t v . Let 
A = [i\, i 2 ,..., } be a subset of 1, 2,..., y with 2 < n < v. For ease of nota¬ 

tion we take A = {1, 2 ,, n], but the reader should have no difficulty replacing 
in actual applications 1, 2,..., n by iu h, … ， in. The main result can then be 
stated as in the following theorem. 


Theorem 5.8 Consider an experiment with t =「[)^=1 G treatments with t\ < 
t2 S ''' S tv- Let A = {1, 2,, n}. Then there exists an EGD / (2 V ~ 1 )-PBIB 
design with parameters 

fl o) 

i=l \i=2 / \j=n+\ ) 

n 

r= n (?i _ i)( ti_i ) k=h 

i=2 

n 

A(i,i,...,i,o,...,o) = na- 2 ) ( ^- 2) 

—— J 

n l —l 

all other X(y) = 0 

Before we prove the theorem we introduce the notation in the following defi¬ 
nition. 


Definition 5.4 Let a r t = (1,2,..., ti) be 1 x ti vectors for / = 1, 2,..., y. 
The symbolic direct multiplication (SDM) of a p and a q (l < p, q < v), denoted 
by a p O a q is given by an array of t q vectors: 


Clp (J) Clq == 





/d. 2 )\ 


((1 5 ^q)\ 



( 2 , 1 ) 

•> 

( 2 , 2 ) 

5 * • • ? 




\(t p , l)J 


\(t p ,2)j 
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where each vector has t p elements and the second components in a given vector 
are identical and equal to 1, 2 ,respectively. □ 

Proof of Theorem 5 .8 We first construct an EGD/(2 W — 1)-PBIB design for 
t* = YYi=\ ^ treatments according to Theorem 5.7. Let the parameters of this 
design be Z?*, r*, k*, X* say, as given in Theorem 5.7. Let B，(j = 1,2,..., b*) 
be a block in the EGD/(2 n — 1)-PBIB design. Further, let d\ = (1,2,..., ti){i = 
1,2,..., y), and let A = {1, 2,..., y} — A = {n + 1, n + 2,..., y} (because we 
have chosen A = {1, 2,, n}). We then define the SDP 

D-^ = d n ^-\ 0 dn^-2 ⑭…⑭ dv 

and consider the SDM 

Bj = b*qDj 

for j = 1 , 2 ,..., b*. Each consists of YVj=n+i O' vectors, each vector having 
t\ elements, each element being an v-tuple and representing a treatment. We thus 
have 

j=n-\-l \i=2 / yj=n+l 

vectors and each such vector is taken as a block for the EGD/(2 V — 1)-PBIB 
design. From the construction it is obvious that any two elements in the same 
block (vector) of B j are different in the first n positions and identical in the 
last (v — n) positions. Hence 入 （ 1,, 1 ， 0, ... ， 0 )= 入 *(1 ， … ， 1) and all other 
X(y) are zero. The other properties of the EGD/(2 V — 1)-PBIB design follow 
easily. □ 

We shall illustrate this procedure to construct a design with blocks of size 3 
for t = 24 treatments. 

Example 5JO Let t = 24, t\ = 2, = 3, = 4. To construct an EGD-PBIB 
design with blocks of size k = 3, we take A = {2, 3} and construct an EGD/3- 
PBIB design for ?* = 12 = 3 x 4 according to Theorem 5.7. For this case we 
define nU(for v = 2) to be 1, so that the matrix P of (5.23) becomes 

"(1,1) (1,2) (1,3) (1,4)' 

P= (2, 1) (2,2) (2,3) (2,4) 

_(3 ， 1) (3,2) (3,3) (3,4)_ 

and Q(P) — P. Then the B* are of the form 

"(1, Dl [(1,1)1 [(1,1)1 [(1,1)1 [(1,1)1 [(1,1)" 

(2, 2) (2,2) (2,3) (2,3) (2,4) (2,4) 

_(3,3 )」 L ( 3 , 4 )」 L ( 3 , 2 )」 L ( 3 , 4 )」 L ( 3 , 2 )」 L( 3 , 3 )_ 
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plus 18 more blocks. Now A = {1} and hence 



Then, for any B* above we obtain 

that is, the B j 9 s are of the form 

( 1 , 1 , 1 ) ( 1 , 1 , 2 ) ( 1 , 1 , 1 ) ( 1 , 1 , 2 ) 
( 2 , 2 , 1 ) ( 2 , 2 , 2 ) ( 2 , 2 , 1 ) ( 2 , 2 , 2 ) 
(3,3, 1) (3,3,2) (3,4,1) (3,4,2) 


and we have a total of 48 blocks. All we need to do now to obtain the final result 
is to interchange the elements in each triplet above so that they are in the “right” 
order, that is, 


(1,1,1) 

(2, 1, 1) 

(1,1,1) 

(2,1,2) 

(1,2, 2) 

(2, 2, 2) 

(1,2,2) 

(2, 2, 2) … 

(1,3,3) 

(2, 3’ 3) 

(1,3,4) 

(2, 3, 4) 

The parameters for this 

design 

then are 

t = 24, b = 48, k = 3, r = 6, and 


入 (0, 1， 1) = 1. □ 

We conclude this section to point out that the EGD-PBIB designs thus con¬ 
structed are disconnected except when A = {1, 2,..., v} which, of course, is 
the design of Theorem 5.7. Though disconnectedness is not desirable in gen¬ 
eral, it can be useful in connection with asymmetrical factorial experiments (see 
Chapter 12). 


5.6 HYPERCUBIC PBIB DESIGNS 

Using the result due to Kageyama (1972) that an EGD/(2 y — 1)-PBIB design 
constructed as a y-fold Kronecker product of BIB designs reduces to a hypercubic 
PBIB design with v associate classes if and only if the v BIB designs have the 
same number of treatments and the same block size, we can state the following 
corollary to Theorem 5.5. 

Corollary 5.1 Let there be v BIB designs D[ with parameters 
々，厂 / ，入 (/) and incidence matrix N[ such that ti = q and ki = k*(i = 
1,2,..., v). Then the Kronecker product design D given by its incidence matrix 


iV = iVl X #2 X … X iVy 
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is a hypercubic PBIB(y) design with parameters t = q v , b = nr=i bi，k = 
(k*) v , r = nj=i r “ an d 入 y = n/=i 。 nj=y+i 入 0) for = 1 ， 2, ... ， v. 

Proof We refer simply to the definition of the hypercubic association scheme 
(Section 4.6.10). Concerning the expression for 入 y, we note that because the ~’s 
and Iq’s are identical it follows that r/ / 入 (/) = constant for each i and hence all the 
入 (Ki, K2,... ， Kv) of the EDG-PBIB with the same number of unity components 
are identical. □ 


Similarly, for a second method of constructing hypercubic PBIB designs, we 
consider a special case of Theorem 5.6 and state the following corollary. 

Corollary 5.2 (Aggarwal, 1974) A hypercubic PBIB design with the param¬ 
eters t = q v , b = q v ~ l (q — l) v_1 , k = q,r = (q — l) v_1 , Ai = ^2 = = 

入 v _i=0, A v = l can be constructed if ^ is a prime or a prime power. 

For the proof we refer to the proof of Theorem 5.6 and the definition of the 
hypercubic association scheme (see Section 4.6.10). 

For another method of constructing hypercubic designs, see Chang (1989). 



CHAPTER 6 


More Block Designs and 
Blocking Structures 


6.1 INTRODUCTION 

In Chapters 3 and 5 we described the construction of two rather large classes 
of incomplete block designs with desirable statistical and mathematical proper¬ 
ties. Yet, these designs do not cover the need for incomplete block designs for 
many practical applications, such as variety, environmental, medical, sensory, and 
screening trials. It is impossible to create a catalogue with all possible designs 
because of the large number of combinations of design parameters. Hence there 
is a need for simple methods of constructing designs. Of particular interest often 
are resolvable designs (see Section 2.7.2) as they allow a certain amount of flex¬ 
ibility with respect to spatial and/or sequential experimentation. In this chapter 
we shall present three quite general algorithms to construct resolvable incomplete 
block designs allowing equal or unequal block sizes. 

Although in general comparisons among all treatments are of primary interest, 
there are situations where this is only of secondary importance. In such situations 
the forms of the experiment is on comparing so-called test treatments versus a 
control (the word control is used here in a general sense with different mean¬ 
ing in different areas of practical applications). Any of the designs discussed in 
previous chapters can be used for that purpose by simply declaring one of the 
t treatments as the control. Because of the special nature of the desired infer¬ 
ence, however, special designs have been developed, and we shall discuss them 
briefly here. 

So far we have only considered designs with one blocking factor. In most 
practical situations this is clearly sufficient to develop an efficient experimental 
protocol. There are, however, situations where it is useful to take into account 
additional blocking factors to further reduce experimental error. We shall consider 
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here briefly one additional blocking factor, one that leads us to the notion 
of row-column designs as generalization of the Latin-square-type designs of 
Chapter 1.10. 


6.2 ALPHA DESIGNS 


These designs, called a-designs, were introduced by Patterson and Williams 
(1976) and further developed by John and Williams (1995) to be used mainly in 
the setting of variety trials in agronomy. 

6.2.1 Construction Method 

The general ideas and steps of constructing an a-design are as follows: 

1. Denote the t treatments by 0, 1,1. 

2. Let t = ks, where k is the desired block size and ^ is the number of blocks 
in each replicate, there being r replicates in the final design. 

3. Select as the generating array a A: x r array a with elements a(p, q) in the 
set of residues mod s (p = 0, 1,2,..., k — l; q = 1,2,... ,r). 

4. Develop each column of a cyclically mod s to generate an intermediate 
k x rs array, a*. 

5. Add i • s to each element in the (i + l)th row of a*(i = 1, 2,..., A ： — 1) to 
obtain the final array, a**, containing elements 0, 1, 2,— 1. 

6 . Separate a** into r sets, where the jth set is formed by the columns, 
(j — l)s + 1, (j. — 1)5 + 2,, (j — l)s + s(j = 1,2,, r), each column 
representing a block. 

Example 6.1 We shall illustrate this construction method by adapting an 
example from John and Williams (1995) for t = 12 = 4x3; that is, k = 4, s = 3. 
In Table 6.1 we give the generating array a, for r = 2, the intermediate array 
a*, the final array a**, and the final design. 

The generating array a in Table 6.1 is called a reduced array, having zeros in 
the first row and first column. All arrays can be represented in the reduced form 
by adding suitable elements to individual rows and columns, always reducing 
mod s. This is convenient for the search of optimal a-designs; that is, designs 
with the highest efficiency factor E [see (1.108)]. For more details see Williams 
and John (2000). 

The efficiency factor itself is closely tied to the concurrence matrix NN r . It 
is easy to verify that NN’ for the design in Table 6.1 has off-diagonal elements 
equal to 0,1,2， implying that some treatment pairs never occur together in a 
block, other pairs occur together once, and still others occur together twice. 
To indicate this property the design will be called a a(0, 1, 2)-design. Just as, 
in general, 2-associate class PBIB designs are more efficient than 3-associate 
class PBIB designs for the same parameters r), so are 2-occurrence class 
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a Array 






0 

0 








0 

1 








0 

2 








0 

0 










Of* 

Array 






0 

1 

2 

0 

1 

2 




0 

1 

2 

1 

2 

0 




0 

1 

2 

2 

0 

1 




0 

1 

2 

0 

1 

2 






Of** 

Array 






0 

1 

2 

0 

1 

2 




3 

4 

5 

4 

5 

3 




6 

7 

8 

8 

6 

7 




9 

10 

11 

9 

10 

11 






Final Design 





Replicate 



1 




2 


Block 


1 

2 

3 


1 

2 

3 



0 

1 

2 


0 

1 

2 



3 

4 

5 


4 

5 

3 



6 

7 

8 


8 

6 

7 



9 

10 

11 


9 

10 

11 


cy-designs preferred over 3-occurrence class a designs (Patterson and Williams, 
1976). For example, the generating array 

0 0 
0 
0 
0 


2 
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leads to an a(0, 1)-design. The efficiency factor for this design can be computed 
to be .71, which, in this case, is about the same as that for the a(0, 1, 2)-design. 
For purposes of comparison we mention that the upper bound (1.110) for the 
efficiency factor is .82. □ 

6.2.2 Available Software 

Williams and Talbot (1993) have provided a design generation software package, 
called ALPHA+, which produces a-designs in the parameter ranges 


2 < r < 10 2<k <20 t < 500 


A more general software package, called CyDesigN, has been provided by 
Whitaker, Williams, and John (1997). Both packages are able to construct designs 
with high efficiency factors relative to the upper bounds for such design thus 
leading to near-optimal designs. 


6.2.3 Alpha Designs with Unequal Block Sizes 

The construction of a-designs is quite flexible and able to accommodate most 
any desirable, practical block size for a given t, or to deal with the situation 
when t cannot be written as ^ = ks(s > 1). In either case we may write 

t = S\k\ + S2k2 

and construct a design with s\ blocks of size k\ and S 2 blocks of size k 2 . Of 
particular interest is the case where k 2 = k\ — l, that is, where the block sizes 
are not too different, differing only by one unit. For this situation an appropriate 
design can be derived from an a-design as follows: 

1. Construct an a-design for t S 2 treatments in 5 = 5 1 2 ! + ^2 blocks of size k\ 
in each replicate. 

2. Delete from this design the treatments labeled — 1. 

Example 6.2 Consider 

^ = 11=2x4+1x3 

that is, s\ = 2, S 2 = 1, k\ = 4, = 3. The a-design in Table 6.1 is a design for 

f +^2 = 11+1 = 12 treatments in blocks of size 4. Deleting treatment 11 yields 
the derived design with two blocks of size 4 and one block of size 3. □ 
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6.3 GENERALIZED CYCLIC INCOMPLETE BLOCK DESIGNS 

We have seen in Section 5.3 that cyclic PBIB designs are easy to construct and 
encompass a large class of useful designs. Jarrett and Hall (1978) extended the 
idea underlying these designs and introduced the notion of generalized cyclic 
incomplete block designs. Basically, for t = m x n, the designs are obtained by 
cyclic development of one or more initial blocks, but rather than adding 1 to the 
element in the initial blocks(s) we now add successively m to each element and 
reduce mod t. 

Example 6.3 Consider f = 12 = 3x4; that is, m = 3, n = 4. Let the initial 
blocks be 


(0,2,3,7)3 and (1,4,5,9) 3 


where the subscript indicates the incrementing number. Cyclic development then 
leads to the following design with b = S, k = 4: 


(0, 2,3, 7) ( 1, 4, 5,9) 

(3, 5,6,10) ( 4, 7 ， 8, 0) 

(6, 8,9, 1) ( 7, 10,11,3) 

(9,11,0, 4) (10, 1, 2,6) 


Inspection shows that = r 2 = = r\\ = 3, r\ = = 

厂 7 = rio = 2. The reason for the two different number of replications can be 
explained as follows. The treatments can be divided into m = 3 groups ofn = 4 
elements by considering the residue classes mod m, that is, 

Si = {/, / + m, ..., / + m(n — 1)} 

for /= 0, 1, …， m — 1. For our example we have 

So = {0, 3, 6, 9} 

A = {1,4,7, 10} 

& = P ， 5,8 ， 11} 

It follows that when an initial block is developed the elements in each residue 
class are equally replicated as each initial block contributes n blocks to the design. 
Now, the two initial blocks contain three elements each from and and two 
elements from S 2 . 

From this argument it follows immediately that in order to construct an 
equireplicate design we need three initial blocks such that each residual class 
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is represented the same number of times in these three blocks. One possible set 
of initial blocks is 

(0, 2, 3, 7) 3 (1,4, 5, 9) 3 (6, 8, 10, 11) 3 

It is easy to select initial blocks to construct an incomplete block design with 
desired parameters. There is, however, no guarantee that this would lead to a 
design with a high efficiency factor. One would need a search algorithm to find 
the “best” design. In general, though, aiming for nearly equal numbers of concur¬ 
rences of all pairs of treatments will yield a design with a high efficiency factor. 

Cyclic development can also be used to construct incomplete block designs 
with unequal block sizes by specifying initial blocks of different sizes. □ 


6.4 DESIGNS BASED ON THE SUCCESSIVE 
DIAGONALIZING METHOD 


Khare and Federer (1981) have presented several methods of constructing resolv¬ 
able incomplete block designs for various factorizations of t, the number of 
treatments. In Section 3.3.4 we have discussed their method when t = K 2 and 
[is a prime or prime power. Here we shall discuss a few other cases. 

6.4.1 Designs for t = Kk 

For K prime or prime power and k < K, where k is the block size, we can 
use the method described in Section 3.3.4 for t = K 2 . We then consider only 
replicates 2, 3,+ 1 and delete in each replicate the treatments numbered 
Kk + 1,..., K 2 . This leads to a binary design with r = K replications and rK 
blocks of size k. These designs are also called rectangular lattice designs (see 
Section 18.11). ^ ^ 、 


Example 6.4 For ? = 12 = 4 x 3 we use the design given in Example 3.8 
and delete treatments 13, 14, 15, 16 from replicates 2, 3, 4, and 5. □ 

Example 6.5 For ? = 8 = 4 x 2 we can again use the design from 
Example 3.8 and delete treatments 9, 10,, 16 from replicates 2, 3, 4, and 
5 to obtain a design with b = 16, k = 2, and r = 4. □ 

6.4.2 Designs with t = n 2 

We consider here the situation where Ms a square but not a square of a prime 
number or prime power. The n can be written as n = mp s , where p s is the small¬ 
est prime in n. The first + 1 replicates from the w + 1 replicates obtained by 
the successive diagonalizing method on n 2 treatments in blocks of size n yield an 
incomplete block design with (0, l)-concurrences for treatment pairs. For addi¬ 
tional replicates one can either repeat some of the original 朽 + 1 replicates or 
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develop further replicates continuing with the successive diagonalizing method. 
In the latter case the numbers of concurrences increase, however, in groups of p s . 

The above method can also be used to generate a design for t = nk{k < n). We 
first obtain the design for t = n 2 and then delete from replicates 2,... ， + 1 the 
treatments numbered nk + l,... ,n 2 . The versatility of this method is illustrated 
by the following example. 


Example 6.6 Fichtner (2000) considered the case of ^ = 400 varieties (treat¬ 
ments) in blocks of size k = \0 with r = 2 replicates for an agronomic trial. In 
the context discussed above, we have ? = 40 x 10. Rather than writing out the 
40 x 40 array for replicate 2 (of the generating design) one only needs to write 
out the 40 x 10 array. Thus the first block would contain the varieties 

(1 ， 41 ， 81 ， 121, 161, 201, 241, 281, 321, 361) 

Adding 1, 2, ...，39 to each element produces the remaining 39 blocks. From 
this it is easy to see that the first block in replicate 2 contains the varieties 

(1,42, 83, 124, 165, 206, 247, 288, 329, 370) 

The remaining blocks can then be filled in easily using replicate 1. We thus obtain 
the following replicate: 


Block Treatments 


1 

1 

42 

83 

124 

165 

206 

247 

288 

329 

370 

2 

2 

43 

84 

125 

166 

207 

248 

289 

330 

371 

3 

3 

44 

85 

126 

167 

208 

249 

290 

331 

372 


39 

39 

80 

81 

122 

163 

204 

245 

286 

327 

368 

40 

40 

41 

82 

123 

164 

205 

246 

287 

328 

369 


If a third replicate were desired, the method of successive diagonalizing would 
yield the initial block as 

(1,43,85, 127,169, 211 ， 253, 295, 337, 379) 

The remaining blocks are obtained by columnwise cyclic substitution (as descri¬ 
bed in Section 3.3.4) using the replicate given above. □ 

For other cases of constructing (0, 1)-concurrence resolvable incomplete block 
designs, we refer the reader to Khare and Federer (1981)，where methods are 
discussed for t = p 2k , t = n^, t = n 2k . 

6.5 COMPARING TREATMENTS WITH A CONTROL 

In some experimental situations one of the treatments, often referred to as the 
control or standard ，may play a major role in that the focus is not so much the 
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comparisons among all treatments but rather the comparisons of the remaining 
treatments, often referred to as the test treatments, with the control. The function 
of the controls might simply be to provide an indication whether the treatments are 
even worth consideration or to show their superiority to established procedures, 
such as comparing new drugs against an established medication, or new varieties 
against an old variety. 

Obviously, any of the designs we have discussed so far would be suitable if 
we simply declare one of the t treatments as the control. However, because of the 
limitation on the inference space the designs may not be a good choice and more 
specialized designs may be called for that are more efficient for this particular 
purpose. Some of these designs will be described in the following sections. 

6.5.1 Supplemented Balance 

Suppose we have t test treatments and one control treatment. Let us denote 
the f + 1 treatments by 7o, Ti, 7*2,..., T t , where To represents the control. In 
the context of block designs, a natural way to construct suitable designs for 
comparing T 2 ” . ” T t with 7o is to consider a block design for T\, T 2 ,..., T t 
and augment or reinforce each block with q(> 1) replications of Tq. This idea 
was suggested by Cox (1958) using a BIB design as the generating block design. 
Das (1958) referred to such designs as reinforced BIB designs. 

A more general class of designs, referred to as supplemented balance designs, 
was introduced by Hoblyn, Pearce, and Freeman (1954) and more formally by 
Pearce (I960). These designs are characterized by the following form of the 
concurrence matrix: 



/so 

人 0 . 

.• 入 0 、 

NN'= 

入 0 

s . 

Al : 


〈入 0 

入 1 • 

s) 


( 6 . 1 ) 


implying that the self-concurrence is the same for all the test treatments. Further, 
Tq occurs together 入 0 times with each of T\, T 2 , ... ， T t , and 7} and Tj(i ^ j, 
j z/z 0) occur together 入 1 times. 

The example below of a supplemented balance design was given by Pearce 
(1953) for t = 4, b = 4, k = 7, tq = S, r = 5, Xq = 10, = 6: 


0 

* U 

0 

1 

2 

^ 5 

3 

八 U 

4 

1 

0 

0 

1 

2 

3 

4 

2 

0 

0 

1 

2 

3 

4 

3 

0 

0 

1 

2 

3 

4 

4 


where each row represents a block. We note here in passing that the design 
above is an example of what we have called an extended block design (see 
Section 1.9.8.5). And for the reinforced BIB design we have 入 0 = qr. 
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For incomplete blocks an obvious way to construct a supplemented balance 
design is to augment each block in a BIB design with t treatments by one or more 
replications of the control. Under certain conditions such designs are A-optimal 
(see Hedayat, Jacroux, and Majumdar, 1988; Stufken, 1987). 


6.5.2 Efficiencies and Optimality Criteria 

Clearly, there exist many ways to construct suitable augmented designs for given 
parameters t ， b, and k. It is, therefore, important to distinguish between competing 
designs and choose the one that is “best” for the purpose at hand, which may 
not only depend on statistical but also on practical considerations. 

From the statistical, that is, inferential, point of view we would like to obtain 
maximum information with respect to comparing To versus 7} (/ = 1 , 2 ,..., t). 
To formulate this more precisely, we consider the usual model for observations 
from a block design, that is, 


yij = M + 爲 • + ^ij (6.2) 

or, in matrix notation 

y = + XpP + X T xj ^ (6.3) 


The information matrix [see (1.9)]，assuming equal block sizes, is given by 


C = R - -NN f 
k 


(6.4) 


where R = X r T X x = diag(ro, r\,... ,r t ) and N = X f T X^. We know [see (1.20)] 
that for an estimable function c f r of the treatment effects 


var(c/r) = c’C 一 c 


For the designs discussed in this section the estimable functions of primary 
interest are of the form zj — To(j = 1,2,t). If we denote by 


/-I 1 0 .. 

-10 10 
-10 0 1 


0 \ 

0 

0 =(-13，/,) (6.5) 


\-l 0 0 . 0 1/ 


the coefficient matrix for the above contrasts, then the variance - covariance 
matrix for the estimators for those contrasts is given by PC~ 
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If we write C of (6.4) as 

r-( c ° c i ) 

Cn) 

then a generalized inverse of C is given by 

/0 ... 0、 

C~ = : 

\0 C7/ ) 

which is equivalent to solving the reduced NE Cr = Q [see (1.8)] by imposing 
the condition fo = 0 (recall that SAS PROC GLM, e.g., obtains a generalized 
inverse equivalently, by assuming x t =0 rather than fo = 0). Using C~ of the 
form (6.7) implies that 

PCP' = c^ 1 

and further that (1) the information matrix for treatment control contrasts is given 
by Cn of (6.6) [see also Majumdar (1996)] and (2) with C^ 1 = {c 11 ) 


( 6 . 6 ) 


(6.7) 


av. var (ij — tq) 




E 

)=1 



( 6 . 8 ) 


where dj(j = 1,2,... ,t) are the eigenvalues of Cn [see (1.106) and Constantine 
(1983)]. i 

For a CRD with rj (j = 0, 1,2,..., t) replications for the jth treatment we 
have, for j ^ 0, 


where 


av. var (ij — fo)= 



^2 

〜 (CRD) 



(6.9) 


The ratio of (6.9) over (6.8) 


(assuming = 



can then be defined 


as the efficiency factor E for a treatment control comparison design (see also 
Section 1.12.2), that is, 


1 / 厂 0 + V〒h 
1 

(i/o (w 


( 6 . 10 ) 


can be used to compare competing designs. 
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Another way to compare designs is to make use of (6.8) and find, for a given 
set of parameters t ， b ， k ， the design Z)*, say, with minimum value of 



where D* is referred to as an A-optimal design (see Hedayat, Jacroux, and 
Majumdar, 1988, and Section 1.13). 

An alternative criterion, called MV-optimality, is to minimize 

max (l/dj) 

i<j<t 

for competing designs (see Hedayat, Jacroux and, Majumdar, 1988). 

A different approach to optimal designs is suggested and explored by Bech- 
hofer and Tamhane (1981 ， 1985). They propose to maximize the confidence 
coefficient for simultaneous confidence intervals for all Xj — ro(j = 1,2,, t), 
thereby linking the designs to Dunnett’s procedures (see 1.7.5.7) for comparing 
treatments with a control (Dunnett, 1955, 1964). In addition to assuming model 
(6.2), this method has to make distributional assumptions about the 〜 ’s, which 
for practical purposes amounts to assuming normality (see Section 6.5.5). 

6.5.3 Balanced Treatment Incomplete Block Designs 

An important and useful class of treatment control designs was introduced by 
Bechhofer and Tamhane (1981), a class they refer to as balanced treatment incom¬ 
plete block designs (BTIBD). 

6.53.1 Definition and Properties 

Definition 6.1 A treatment control design with t test treatments and one 
control treatment in b blocks of size k < t \ is called a balanced treatment 
incomplete block design, denoted by BTIBD(^, b, k\ 入 o , 入 1 )，if (1) each test treat¬ 
ment occurs together with the control 入 o times in a block and (2) any two test 
treatments occur together 入 i times in a block. □ 

The concurrence matrix of a BTIBD is then of the form 



(SO 

入 0 … 

… 入 0 、 



入 0 

^1 入 1 

... 入 1 


NN'= 

入 0 

Ai ^2 

. •.入 1 

(6.11) 


、入 0 

入 1 ... 

>•1 S t ) 
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Comparing (6.11) and (6.1) shows that some designs with supplemented balance 
are special cases of BTIBDs. 

Since the BTIBDs are, by definition, connected designs, it follows that 
rank(C) = t, where C is given in (6.4). It follows then from (6.4), with N = 
(riji), and (6.11) that 


and 


^0 




tX 


n 


Z' 


ji 


入 o 

T 


( 卜 l) 入 i 

~ k 


(y = 1, 2,, t) 


( 6 . 12 ) 


It follows from (6.12) that 

r j ~ ~j^ 4 = [ 入 0 ~ — l)A ， i ]/k 


and hence Cn of (6.6) has the form 

Cn =al + b00 f 


with 


A，o + tX\ 
k 


b 


- 入 1 

~k~ 


(6.13) 


(6.14) 


Using (6.13) and (6.14) it is then easy to obtain C^ 1 , which is of the same form 
as Cn, say 


c# =cl + d00 f 


We find 


k 


and d 


kX\ 


Xo tX\ 人 0( 入 o + f 入 l) 

and hence from our discussion above this implies that 

众 ( 入 0 + 入 1) 2 


var(f 7 - - f 0 ) 


入 0( 入 0 + 广入 1) 


(j = 1,2, ... ,t) 


(6.15) 


(6.16) 


(6.17) 


and 


CO\(Tj - fo, ry - fo) 


kX\ 


入 o( 入 o + f 入 1) 


(j ^ /) (6.18) 
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Expressions (6.17) and (6.18) explain, of course, in what sense these designs are 
balanced. 

6.53.2 Construction Methods 

There exist different methods of constructing BTIB designs, and we shall mention 
a few without going into all the details. 

One obvious method, mentioned already in Section 6.5.1, is to augment each 
block of a BIBD( ，， b, k, r, A ， i) with n copies of the control. This is a special case 
of a more general class of BTIB designs, which we denote (following Hedayat, 
Jacroux, and Majumdar, 1988) by BTIBD( ，， 办，众 *; w, 入 o, 入 l) and which is char¬ 
acterized by the following form of the incidence matrix: 

riji = 0 or 1 for j = 1,2,... ,t i = 1, 2,..., Z? 

and 

行 oi = ^02 = ... = = w + 1 

«o,5+i = ^ 0 , 5+2 = • - = nob = u 

The special case mentioned above has ^ = 0 and hence k* = k u. Such a design 
is of rectangular or 沢 -type，and the general form is depicted in Figure 6.1，where 
D\ represents the BIBD(r, b, k, r, A ， i). 

For ^ > 0 the design is said to be of step or 5-type. Its general form is shown 
in Figure 6.1. 

One method to construct an 5-type BTIB(? ， Z? ， A ： * ， 入 o, 入 i) is to choose BIB 
designs for D: and D 3 , more specifically 

BIBDO, s,k — l, r( 2 ) ； 入⑵） 

and 

D 3 = BIBDO, b - s,k ， r ⑶; 入⑶） 

so that k* = k -^r u, Xq = (w + l)r( 2 ) + wr( 3 )，and = 人 ( 2 ) + 入 ( 3 ). 

For this particular method of construction we make the following comments: 

1. Let us denote by and the two component designs of the 5-type 
BTIB design (see Fig. 6.1). Then and with D 2 and D 3 as given in 
(6.19) and (6.20) ， respectively, are themselves BTIB designs. They constitute 
what Bechhofer and Tamhane (1981) have called generator designs (see 
Definition 6.2). 

2. As a special case of 5-type designs we can have w = 0; that is, the blocks 
in D 3 do not contain the control, which means that = D 3 . 

3. For u = 0 the design D 3 can be a RCBD，possibly with b — s = 1. 


(6.19) 


( 6 . 20 ) 
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EU 


Block 

1 2 … b 


1 


2 

Control 

u 


M + 1 

Di 

u-\-k 


⑻ 


1 2 … s 

s -\-l … b 


1 



1 

2 

Control 

Control 

2 

u 

M + 1 



u 



M + 1 



d 3 



d 2 



u -\- k 



u k 


D* 

(b) 

Figure 6.1 General form of (a) R- and (b) 5-type BTIB designs. 


4. As a trivial case (but only in the context of BTIB designs) the design D 2 
may have blocks of size 1 with s = mt\ that is, each treatment occurs m 
times. An important case in this context is k* = 2, m = 1 with 

Z) 2 = {1,2, •..，，} (6.21) 

and 

^-{2 5 ... '7 1 } (622) 

Hedayat and Majumdar (1984) denote D 2 of (6.21) and D 3 of (6.22) by ^ ^ 1 
and ， [ 2 , respectively, where p in general denotes all (P distinct blocks 
of size p for t treatments. 
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Let us now return to the notion of generator designs (GD) and their function 
in constructing BTIB designs. Bechhofer and Tamhane (1981) give the following 
definition. 


Definition 6.2 For t test treatments and blocks of size k* a generator design 
is a BTIB design no proper subset of whose blocks forms a BTIB design, and 
no block of which contains only one of the t + 1 treatments. 

To illustrate this notion we use the following example. □ 


Example 6J For t = 3 and k* = 3 consider 


GDi 


0 0 0 
0 0 0 
1 2 3 


入 o = 2 


GD 2 


0 0 0 
1 1 2 
2 3 3 


入 o = 2 


GD 3 


入 o = 0 


gd 4 


0 0 0 1 
0 0 0 2 
12 3 3 


入 o = 2 


入 i = 0 


入 1 = 1 


入 l 


入1 = 1 


Of these four designs GDi, GD 2 , and GD 3 are generator designs, but GD 4 is 
not, because GD 4 = GDi |J GD 3 . □ 


The design GD 4 illustrates the construction of a BTIB design, D say, in terms 
of the generator designs: 

D = /iGDi 1J / 2 GD 2 1J / 3 GD 3 (6.23) 

with fi > 0(/ = 1, 2, 3) and at least f\ or /2 > 1 for an implementable BTIB 
design. We note that the design D in (6.23) with /；• > 1( / = 1, 2, 3) represents 
another form of 5-type BTIB design, which we might refer to as a 5^-type design 
because it has two steps. In general, we might have a 5^-type design 


q 

D^\Jfi GDi 

i=l 

with GD/ having bi blocks and b = fibi. Apart from the control the GD/ 
represent BIB/RCB designs with increasing block size k(l < k < k*). 
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Replacing 5, 6, and 7 by 0 leads to the BTIBD(4, 7, 4; 6, 2): 


3 1112 12 
0 4 2 2 3 3 4 
0 0 0 3 4 4 0 
0 0 0 0 0 0 0 


(6.26) 


Suppose we want to construct a BTIBD(4, 8, 4; 入 o, 入 l). We could adjoin to 
the GDi given by (6.26) the GD 2 =4^4 and obtain the following BTIBD(4, 
8 , 4; 6, 3): ^ 


3 1112 12 1 
04223342 
00034403 
00000004 


Replacing 6 and 7 by 0 leads to the BTIBD(5, 7, 4; 4, 2): 


and replacing 7 by 0 leads to the BTIBD(6, 7, 4; 2, 2): 


The above then represents a method of constructing generator designs. Another 
method is to start with a BIBD(r*, b, k, r; X) with t* > t and identify the treat¬ 
ments t l, t 2,t* with the control, that is, treatment 0 (Bechhofer and 
Tamhane, 1981). Any block with 0’s only would be deleted. We illustrate this 
method with the following example. 

Example 6.8 Starting with the BIBD(7, 7, 4, 4; 2) 


5 

2 

( 6 . 


2 4 5 0 

13 4 5 

2 3 4 0 
12 3 0 
12 5 0 

14 0 0 

3 5 0 0 


24 ) 



2 4 5 6 

13 4 5 

2 3 4 0 
12 3 6 
12 5 0 

14 6 0 

3 5 6 0 


2 4 5 6 

13 4 5 

2 3 4 7 
12 3 6 
12 5 7 

14 6 7 

3 5 6 7 


(6.27) 
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We should mention that instead of replacing 5 ，6， 7, by 0 we could have 
replaced, for example, 6, 7, and 2 by 0, yielding 


0 0 0 0 0 0 1 

0 0 0 0 0 0 3 

3 1113 4 4 

5 4 5 3 4 5 5 


Other methods of constructing generator designs are given by Bechhofer and 
Tamhane (1981). □ 


6.5.4 Partially Balanced Treatment Incomplete Block Designs 

To further enrich the class of treatment control designs, an obvious step is to 
generalize BTIB designs to PBTIB, or partially balanced treatment incomplete 
block designs, just as BIB designs were generalized to PBIB designs. Rashed 
(1984) introduced two types of PBTIB designs, and we shall give some of his 
results below. Another form of PBTIB design was defined by Jacroux (1987) 
(see also Stufken, 1991). 

6.5.4.1 Definitions and Structures 

Definition 6.3 (Rashed, 1984) A treatment control design for t test treat¬ 
ments and one control is said to be a PBTIB type I design if it satisfies the 
following conditions: 

1. The experimental units are divided into b blocks of size k{k < 0- 

2. Each test treatment occurs in r blocks, but at most once in a block, and the 
control is replicated ro times with possibly multiple applications in a block. 

3. The set of test treatments T = {1, 2, ...，，} is divided into two disjoint sub¬ 
sets, T\ and T 2 , with q and t — q treatments, respectively. Each test treatment 
occurs together in a block with the control 入 01 or 入 02 times, where 

b 

入 01 = ^rijinpi for j e T\ 
i=\ 

b 

入 02 = ^rijinpi for j e T 2 
i=l 

4. Any two treatments in Ti occur together in a block 入《 times (£ = 1, 2). 

5. Any two treatments, one from T\ and the other from T:, occur together in a 
block 入 12 times. 
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It follows then that the matrix C\\ of (6.6) is of the form 

(al q + b3 q 3， q e0 q 0^ \ 

C n = (6.28) 

\ e ^q^q + ^q^q) 

with q = t — q and a = r — r I k 七 k\\l k,b = —"k\\l k, c = r — r I k "kuj k, 
d = —^n/ k, and e = —A 12 / k. D 

Since C^ 1 is of the same form as C\\ of (6.28) we can state the following 
theorem. 


Theorem 6.1 For a PBTIB type I design the variance-covariance structure 
of the estimators r) — tq for Tj — to is given by 


var(r}- - ro)= 


CO\(tj - tq, ty - rb)= 


ot\o^ for j e T\ 

oi\o^ for j e T2 

PH 2 for j，i’ e T\ 

P 2 o] for j, i f e T 2 

P^e for / e T u j , e T 2 


where a\, p\, P 2 , Pz are functions of the design parameters (see Rashed, 

1984). ‘ 


An illustration of a PBTIB type I design is given in the following example. 


Example 6.9 For t = 5, b = S, k = 3, r = 3, ro = 9 the two groups are 
T\ = {2, 3,4}, T 2 = {1, 5} with 入 01 = 2, 入 02 = 3 , 入 11 = 1, 入 22 = 0, 入 12 = 1. The 
design is given by 

"0 0 0 0 0 0 1 2 " 

D = 0 0 0 1 3 4 2 3 

1 2 5 3 4 5 4 5 


and 


(.619 

.19 

.19 

.166 

.166\ 

.19 

.619 

.19 

.166 

.166 

.19 

.19 

.619 

.166 

.166 

.166 

.166 

.166 

.583 

•083 

(.166 

.166 

.166 

.083 

.583 〉 


that is, a\ = .619, a\ = .583, p\ = .19, P 2 = .083, and ^3 = .166. □ 
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Another type of PBTIB design that retains the feature of two variances but 
allows for up to six different covariances for comparison estimators is described 
in the following definition. 

Definition 6.4 (Rashed, 1984) A treatment control design for t test treat¬ 
ments and one control is said to be a PBTIB type II design if it satisfies the 
following conditions. Conditions 1, 2, and 3 are as in Definition 6.3: 

4. Each group Ti considered by itself has either a BIB or a PBIB association 
structure, with at least one of these groups having PBIB design properties; 
that is, any two treatments in 7/ occur together in either 入 ^ 1 ) or 入 g) blocks 
with kii ⑴士 for at least one / = 1, 2. 

5. Any two treatments, one from T\ and the other from 72, occur together in 

X \2 blocks with 入 i 2 = or 入斤 )（/ = 1, 2). □ 

We note that, if in condition 4 above both groups have BIB association struc¬ 
ture, then the PBTIB type II design reduces to a PBTIB type I design. This gives 
an indication how much more general type II designs are compared to type I 
designs. 

For type II designs Rashed (1984) proved the following theorem. 

Theorem 6.2 For a PBTIB type II design the variance - covariance structure 


of the estimators r) — ro for tj 

-r 0 is 

given by 




for 

j e ^ 


var(r}- - to) = • 







for 

j e T 2 



Pll〜 2 

or 

Pncr ； 

for j, / e Ti 

cov(r}- - to) = * 

P2icr e 2 

or 


for j, / e 7o 


P3icr e 2 

or 

P32〜 2 

for j eT u / e T 2 


We shall illustrate this structure with the following example. 

Example 6.10 For r = 6, Z? = 8, A: = 3, r = 3, r = 3, ro = 6, the two groups 

are Ti = {1, 2, 5, 6} and T 2 = {3, 4}, and the design is given by 

"1 2 3 0 0 0 0 0~ 

D = 2 3 4 4 5 1 0 1 

45656623 
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with 入 01 = 2 ，入 02 = 1, 入 (l? = 1, 入 ^) = 0 (for a PBIB association structure): 


0th Associate 

1st Associates 

2nd Associates 

1 

2, 6 

5 

2 

1, 5 

6 

5 

2, 6 

1 

6 

1， 5 

2 


and A ，22 = 1 (for a BIB association structure) and 入 以 =1. The vari¬ 
ance-covariance matrix (if the treatments are written in the order 1, 2 ， 5, 6, 
3, 4) is found to be 


A 66 

.22 

.16 

.22 

.25 

.25\ 

.22 

.66 

.22 

.16 

.25 

.25 

.16 

.22 

.66 

.22 

.25 

.25 

.22 

.16 

.22 

.66 

.25 

.25 

.25 

.25 

.25 

.25 

.71 

.29 

\.25 

.25 

.25 

.25 

.29 

•71/ 


that is, Qfj = .66, = .71, pii = .22, p \2 = .16, P 21 = .20, P 3 i = .25. 


□ 


6.5.4.2 Construction of PBTIB Designs 

We shall describe briefly one method of constructing PBTIB designs as developed 
by Rashed (1984). It is an extension of one of the methods described by Bechhofer 
and Tamhane (1981) to construct BTIB designs (see Section 6.5.3.2). Generally 
speaking, in order to construct a PBTIB design with t treatments in b blocks of 
size k(<t), we start with a PBIB design with t*(>t) treatments in b*(<b) blocks 
of size k* = k, and then replace the t* — t “excess” treatments by 0, deleting any 
resulting blocks which contain only the control. This is a very general recipe and 
may not necessarily lead to a PBTIB design unless we impose further conditions. 
Therefore, to be more precise, we start with a GD-PBIB design (see Section 4.6.1) 
and proceed as follows: 

1. Write out the GD association scheme for t* = t* x treatments, that is, an 
array of rows and columns, which we denote by T. 

2. For 


t = (t\\ x tn) + fei x t2l) 

form two nonoverlapping subarrays x t\ 2 ) and x t 2 i) within T 

such that T\ and T 2 each form a GD association scheme. 

3. Denote the remaining set of treatments by To and replace those treatments 
by 0 (these are the excess treatments mentioned above). 
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Schematically this procedure can be depicted as follows: 


hi 


K 


t2l 


q ^ 

This implies that 

^ 11+^21 < 
t\2 < 

til < 

Following steps 1-3 and returning to the starting PBIB design yields the 
required PBTIB design in which each treatment is replicated r times and the 
control is replicated ro times where 

ro = (t*t2 — t\\t\2 — t2\t2i)r = (t* — t)r 

Furthermore, the remaining parameters of the PBTIB design are related to those 
of the PBIB design as follows: 

入 L !)= = ^2 (/ = 1, 2) 

(C) exists only if 7/ has more than one treatment in the same row of T, and 

(2) 

W exists only if 7/ has treatments in more than one row of T), 

入 0/ = a/ 入 i + bik: (/ = 1, 2) 


tn 



where at equals the number of treatments that are replaced by zeros and are 1st 
associates of a treatment in T“ that is, a[ = ?| — t( 2 , and b[ equals the number 
of treatments that are replaced by zeros and are 2nd associates of a treatment in 
TI, th&t is, b( = t* _ t _ (?2 _ f/2) — _ t _ Q-i (i == 1, 2), and 

入 12 = 入2 

We shall illustrate the construction of a PBTIB design with the following 
examples. 
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T 2 3 6 1, 2, 4, 5 ， 7， 8 

6 3 1, 2 ， 4, 5, 7， 8 

It follows, therefore, that the design (6.29) is a PBTIB type II design with param¬ 
eters r = ro = 3, 人 oi = 1 ，人 02 = 0, 人 = 0, A,^ = 1, = 0, X \2 = 1. □ 

Example 6.12 Suppose t = 1, k = 3. We start again with design SR 23 as 
given in Example 6.11. We now replace 8 and 9 by 0, that is, 7o = {8, 9} and 
then have 

乃 ={1 4 7 } T 2 =l 2 3 I 


The association schemes for T\ and T 2 follow from the association scheme for 
T as 

0th Associate 1st Associates 2nd Associates 


If we replace 9 by 0, that is, To = {9}, then 

Tl = {l 5 8 乃 ={3 6 } 

and the PBTIB design is given by 


and association structure T : 


Example 6.11 Suppose t = S, k = 3. We start with design SR 23 from Clat- 
worthy (1973) with t* = 9, k = 3, r = 3, b* = 9, = 0, X 2 = 1: 


8 7 8 7 8 7 
6 ,心心 vcT vcT ^ 
5 4 5 4 5 4 
3, 3，3，3，3， 3, 
2 ' —- 2 . —- 2 . —_ 


7 8 7 8 4 5 
4, 5, 1，2, 1，2, 


12 4 5 7 8 


rl 


9) 

2 

( 6 . 



3 5 7 

2 4 0 
16 8 

3 4 8 
2 6 7 
15 0 
7 8 0 

4 5 6 
12 3 



7 8 9 
4 5 6 
12 3 



3 5 7 

2 4 9 
16 8 

3 4 8 
2 6 7 
15 9 
7 8 9 

4 5 6 
12 3 
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or, schematically, 

1 4 1 < — T\ 

— To 


T 2 

The resulting PBTIB design is 

1 4 7 1 2 3 1 2 3' 

250564645 (6.30) 

36007000 7_ 

with parameters ro = 6 , = 2 , 人 02 = 1 , 人 (j) = 0 , = 0 , = 1 , 人 12 = 1 . 

It is easy to verify that the design (6.30) is a PBTIB type II design. 

Since there exist many GD-PBIB designs (see Clatworthy, 1973) the method 
described above will yield a rather rich class of PBTIB designs, most of them 
being of type II. □ 

6.5.5 Optimal Designs 

In the preceding sections we have described the construction of various types of 
treatment control designs. For a prespecified value of the number of test treat¬ 
ments t and block size k, different methods may lead to different designs, some 
more efficient than others. In deciding which design to choose among competing 
designs and to use in a practical application, one consideration may be to com¬ 
pare their efficiencies and look, if possible, for the optimal design. In this context 
different optimality criteria have been used (see Section 6.5.2), based on different 
philosophies [for a discussion see Hedayat, Jacroux, and Majumdar (1988), and 
the comments following their article]. Unfortunately, different optimality criteria 
may lead to different optimal designs, but often the optimal design using one 
criterion is near optimal under another criterion. 

Using the A-optimality criterion Hedayat and Majumdar (1984) provide for the 
class of BTIB designs a catalogue of optimal R- and 5-type designs for block 
sizes 2 < A:* < 8 . Optimal 尺 -type designs are also given by Stufken (1987). 
These results are based on a theorem by Majumdar and Notz (1983), which can 
be stated as in the following theorem (Hedayat and Majumdar, 1984). 

Theorem 6.3 For given (t,b,k*) a BTIBD(r, b, k*, u, s\ Xo, A ， i) (see 
Section 6 .5.3.2) is A-optimal if forx = u and z = s the following function is 
minimized: 

幺 (X, z) = (t — l)^[btk*(k* — 1) — {bx + z)(k*t — t -jr k) (bx^ + 2xz + z)] 

+ [k*(bx + x) — (bx 2 + 2xz + z)] 一 1 
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among integers x = 0, 1, •.., [k*/2], z = 0, l,..., b — l, with [A:*/2] the largest 
integer not greater than k*/2, and z positive when x = 0 and z = 0 when x = 
[kV2]. 

A different approach to constructing optimal BTIB designs has been provided 
by Bechhofer and Tamhane (1983a, 1983b, 1985). Their criterion is maximiza¬ 
tion of the confidence coefficient for the joint confidence intervals of prescribed 
allowance (or length) for Tj — ro(j = 1 , 2 ,..., t), namely, for l > 0 , 

P\ = P{rj - to > r}- - i; j = 1 , 2 , (6.31) 

for one-sided confidence intervals, and 

Pi = P{Tj - To - I < Tj - To <tj - j = 1,2,... ,t} (6.32) 

for two-sided confidence intervals. This requires that additional assumptions have 
to be made about the error e。in model (6.2). Assuming that the 句 are i.i.d. 
N(0, a$) and denoting the right-hand sides of (6.17) and (6.18) by a 2 a^ and 
respectively, then expressions for Pi in (6.31) and P2 in (6.32) can be 
written as 



where O (•) denotes the standard normal cumulative distribution function (cdf). 

We note here, parenthetically, that the assumption about the e(j does not agree 
with our underlying framework of randomization theory (see Sections 1.6 and 
1.10). We, therefore, consider the confidence coefficients in (6.33) and (6.34) to 
be approximate confidence coefficients. Nevertheless, the proposed procedures 
of finding optimal designs based on maximizing P\ or P 2 are useful. We note 
also that for design purposes is assumed to be known, based on prior experi¬ 
ences or theoretical subject matter considerations. But as Bechhofer and Tamhane 
(1985) suggest, for analysis purposes an estimate of cr$, based on the analysis 
of variance, should be used in connection with Dunnett’s procedure (Dunnett, 
1955; see also 1.7). 

Tables of optimal BTIB designs for one-sided and two-sided comparisons 
for 2 < ? < 6 ), k* =2,3 and a wide range of values for b and i!G e are given 
by Bechhofer and Tamhane (1985). The procedure is, for a desired confidence 
coefficient, to combine one or more replicates from a table of minimal complete 
sets of generator designs. 
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6.6 ROW-COLUMN DESIGNS 
6.6.1 Introduction 

For purposes of error control or error reduction the principle of blocking is of 
primary importance. So far we have considered only one blocking factor, and 
for most situations this is quite satisfactory. There are, however, situations where 
blocking in more than one dimension may be needed. For example, in an agro¬ 
nomic or forestry trial it is not uncommon to compare a large number of hybrid 
varieties (which represent the treatments), which leads to a large experimental 
area, and it may be important to account for or eliminate the effects of fertility 
trends in the land in two directions. This implies that one needs to block or elim¬ 
inate heterogeneity in two directions. The two blocking systems are referred to 
generally (borrowing terminology from field experiments) as row blocking and 
column blocking, and the resulting designs are referred to as row-column designs. 
Examples of such designs are the Latin square type designs of Chapter 1.10, but 
other designs may be needed with incomplete blocks in both directions. As an 
example consider the following design given by John and Williams (1995) with 
t = 12 treatments in r = 4 rows and c = 9 columns: 



1 

2 

3 

Column 

4 5 6 

7 

8 

9 

1 

4 

8 

5 

9 

11 

3 

2 

1 

7 

2 

12 

11 

2 

3 

5 

10 

4 

8 

6 

3 

7 

10 

9 

4 

6 

5 

1 

12 

8 

4 

11 

9 

12 

6 

1 

7 

10 

3 

2 


Both rows and columns represent incomplete blocks, and the row design and 
column design, which we refer to as the component designs, are characterized 
by their respective incidence matrices, N p and N y say. We shall see later (see 
Section 6.6.2) that the properties of the row-column design depend on N p and 
N y , but that it is not straightforward how to combine the component designs to 
produce a “good” final design. We shall return to this point in Section 6.6.3 after 
discussing the analysis of a row-column design in Section 6.6.2. 

6.6.2 Model and Normal Equations 

Consider a row-column design with r rows, c columns, and t treatments where 
treatment k is replicated times {k = 1,2,..., t). Let yijk denote the observation 
for the experimental unit in the ith row and jth column to which treatment k has 
been applied. Based on the assumption of unit treatment additivity (see Section 
1.10.2.3), a model for ytjk can be derived [despite criticism by Srivastava (1993, 
1996)，and Srivastava and Wang (1998)] and written as 


yijk = M + P/ + Yj U jk 


( 6 . 35 ) 
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where r' =( 厂 i, ， 2 ,.. •, r，) and r 8 = diag(ri, 厂2 , •.. ， 厂 r). 

Similar to Section 1.3.2 we shall derive from (6.40) the reduced NE as follows: 
Using E A = 0 and E 巧 = 0 (as suggested by E P/ = 0 and yj = 0), we 
obtain from the first equation of (6.40) 

^ G r'r 

[l = - 

n 


where pi(i = 1,2,, r), yj(j = 1,2,, c), Tk(k = 1,2,... ,t) represent the 
row, column, and treatment effects, respectively, which are defined such that 
J2 Pi = 0, E y) = 0, E = 0 (see Section 1.10.2.3). The eijk represent the 
combined experimental and observational error. Let us rewrite (6.35) in the usual 
fashion in matrix notation as follows: 

y = fiO + X pp + Xyy + X T x + e (6.36) 

where p, y, r represent the row, column, and treatment effect vectors, respec¬ 
tively, and X p , X y , X T are known matrices linking the row, column, and treat¬ 
ment effects to the observations. If we write, for example, the observation y such 
that the first c components are the observation in row 1, the next c from row 2, 
and so forth, then we have 


of dimension rc x r, and 


of dimension rc x c. Further, the relationships between X p , X y , and X T are 
X f T X p = N p X f T X y = Ny (6.3 


Using (6.37), (6.38), and (6.39), it is then easy to write the NE for model (6.36) as 


( 6 . 



<：! ?r 






n 


(6.41) 
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where we are using the following obvious notation for the right-hand side of 
(6.40): i i i 


(t c y\ 


(G\ 

x' p y 


p 

X' y y 


r 

\Ky) 


v) 


(6.42) 


and rc = n. From the second and third set of the NE (6.40) we obtain 

cp = P- c^ r - N f 0 r 


(6.43) 


and 


ry = T — r^l3 c — N f yX 


(6.44) 


Substituting (6.41) ，（ 6.43)，and (6.44) into the fourth set of equations in (6.40) 
and simplifying, we obtain the RNE for r as 


- -N P N' - -N Y N y ' + —) T = (t - -N p P - -N y T + r -、 


(6.45) 


which we write in abbreviated form as 

Cr= Q 


(6.46) 


with C and Q as defined in (6.45). A solution to (6.46) can be written as 

r = CQ (6.47) 


where C~ is a generalized inverse of C. 

Any statistical inference about the treatment effects is then derived, similar 
to the developments in Chapter 1, from (6.46) and (6.47), in conjunction with 
information from the analysis of variance (see Section 6.6.3). For example, rank 
(C)(<t — 1) determines the number of linearly independent estimable functions, 
c r r, of the treatment effects. For any estimable function c'x we have c'x = c f< c, 
with from (6.47), and 

var(c /1 r) = c r C~ca^ 


6.6.3 Analysis of Variance 

Following general principles (see, e.g” Sections 1.4.11 and 1.3.6), it is easy to 
write out the analysis of variance table for a row-column design that will enable 
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Table 6.2 Analysis of Variance for Row-Column 
Design 


Source 

d.f. 

SS 

x P \o 

r- 1 

ci>,+.+- 歹 … ) 2 
i=l 

X y \D,X p 

c — 1 

j=l 

X r \3,X p ,X y 

t-\ a 

r'Q 

I\0,X p ,X y ,X T 

Difference = ve 

Difference = SS(E) 

Total 

rc — \ 




Uj,k 


a Assuming a connected design. 


us to (i) estimate and (ii) obtain an (approximate) F test for testing H^\ x\ = 
T 2 = • • • = r? = 0. Such an ANOVA table is given in Table 6.2, using an earlier 
established notation. We then have 

SS(E) 

o 2 e = MS(E)=—— 
and 

r _r f Q/(t-l) 

— MS(E) 

provides an approximate test for testing Ho ： ri = t 2 = • • • = = 0. 

It should be mentioned (as is, of course, obvious from the notation used) that 
the ANOVA in Table 6.2 is a sequential (or type I in SAS terminology) ANOVA. 
The nature of the design implies immediately that SS(Z y |3, X p ) = SS(Z K |3), 
and hence the simple expression for SS(Z K |3, X p ) given in Table 6.2. 

6.6.4 An Example 

We shall illustrate the analysis of data from a row-column design, using the 
design of Section 6.1 and using SAS PROC GLM. The data and analysis are 
given in Table 6.3. We make the following comments about the analysis and the 
SAS output in Table 6.3: 

1. The inv-option in the model statement provides a generalized inverse to the 
coefficient matrix of the NE (6.40), and the 12 x 12 submatrix given by the 
rows and columns T\, T 2 , … ， T \2 serve as a C 一 for the RNE (6.46). 
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MORE BLOCK DESIGNS AND BLOCKING STRUCTURES 


2. The e-option in the model statement provides a list of linearly independent 
estimable functions. The output shows that this design is a connected design, 
that is, all treatment comparisons are estimable. 

3. The P-values for R and C should be ignored (see Section 1.10.2.5). 

4. All treatment least-squares means have the same standard error because of 
the same number of replications. 

5. Not all simple treatment comparisons have the same standard error. This has 
to do with the fact that N p and N y represent incomplete block designs. For 
example, treatment 1 occurs together with treatments 7 and 9 three times in 
the same row, whereas it occurs together twice in the same row with all other 
treatments; but treatment 1 never occurs together with treatments 7 and 9 in 
the same column, whereas it occurs together once in the same column with 
all other treatments. Hence there are two different variances and, hence, two 
different standard errors for simple treatment comparisons, for example, 4.81 
and 4.68. We add here, however, that, in spite of their appearance, neither 
N p nor N y represent PBIB(2) designs. 

6.6.5 Regular Row-Column Designs 

As we have already seen in the previous sections, a row-column design consists 
of the amalgamation of two component designs, characterized by N p and N y , 
the row and column designs, respectively. For example, the row-column designs 
in Table 6.4 can be characterized (prior to an appropriate rearrangement of the 
treatments) easily in terms of well-defined error-control designs. We might refer 
to these designs as regular row-column designs. 

6.6.6 Doubly Incomplete Row-Column Designs 

One feature that all the designs in Table 6.4 have in common is that the number 
of columns, that is, the number of experimental units in each row, is equal to 
a multiple of the number of treatments. Of considerable interest, however, are 
designs for which the number of rows and columns is less than the number of 
treatments. A natural way to construct such designs would be to amalgamate two 
PBIB designs. 

Example 6.13 For ? = 8,r = 4, c = 4 Eccleston and Russell (1975) give the 
following design: 

Column 
12 3 4 
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with 入 i = 1 , 入 2 = 2 , 入 3 = 0 , and 


/o 

l 

0 

o\ 


/o 

0 

1 

o\ 


(o 

0 

0 

A 

1 

0 

1 

2 

尸 2 = 

0 

4 

0 

0 

尸 3 = 

0 

4 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

1 

\o 

2 

0 

0 


lo 

0 

0 

2 


U 

0 

1 

0 


Table 6.4 Regular Row-Column Designs 


Row Design 

Column Design 

Row-Column 
Design 

Reference 

RCBD 

RCBD 

Latin square 

1.10.2 

RCBD 

GRBD 

Latin square 

1.10.4 

GRBD 

GRBD 

Frequency square 

Hedayat and 

Seiden (1970); 
Hedayat, 
Raghavarao, and 
Seiden (1975) 

RCBD 

BIBD 

Youden square 

I. 10.5 

RCBD 

PBIBD 

Youden square 

1.10.5 

RCBD 

Extended block 

Incomplete Latin 

Cochran and Cox 


design 6 * 

square 

(1957) 

BBD 

BBD* 

Generalized 

Youden design 

Kiefer (1975a, b) 


a See I.9.8.5. 

^See Definition 19.9. 


By inspection we find that both row and column designs are PBIB(3) designs 
with the following association schemes: 

Row design: 

0th Associate 1st Associates 2nd Associates 3rd Associates 


7 8 12 
3, 4，5，6, 


5 6 7 8 


8 12 3 
6，7，8， 1, 
4，5，6，7, 
2, 3, 4, 5, 


12 3 4 


3 4 5 6 
7，8，1，2, 


12 3 4 


4 5 6 7 
2, 3，4， 5, 
8，1，2，3, 
6, 7, 8, 1, 


5 6 7 8 






with 入 1 = 1, 入 2 = 2, 入 3 = 0， and P matrices the same as for the row 
design. □ 

6.6.7 Properties of Row-Column Designs 

We shall restrict ourselves to designs for which each treatment is replicated the 
same number of times, say r。. Moreover, we shall consider a subclass of designs, 
denoted by Do by Shah (1977), with the property that each row of the design 
has ro treatments in common with each column (this holds for all the designs 
discussed so far). In terms of the component design incidence matrices, this 
condition can be expressed as 

N f p Ny = r 0 OrO f c (6.48) 

The condition (6.48) is equivalent to the property of adjusted orthogonality 
for the row-column design (John and Eccleston, 1986). For the type of design 
considered here Eccleston and Russell (1975, 1977) define adjusted orthogonality 
to mean that using model (6.35) the contrast estimators for row effects adjusted 
for treatment effects are uncorrelated with the contrast estimators for column 
effects adjusted for treatment effects. This property is important because it has 
implications with respect to the connectedness of a row-column design. Thus 
Eccleston and Russell (1975) show that a row-column design is connected if it 
satisfies the property of adjusted orthogonality and if both the row and column 
designs are connected. This, in turn, means that for any design in Do its properties 
are determined by the properties of the component designs. 

One important characteristic of an incomplete block design is its efficiency fac¬ 
tor, which is related to the properties of the information matrix C (see 
Section 1.12.2). For the row-column design the information matrix as given 
in (6.37) can be written in terms of the information matrices for the component 
designs as follows: 
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Column design: 

0th Associate 1st Associates 2nd Associates 3rd Associates 


43438787 
2, 1, 2, 1, 6, 5, 6, 5, 


34127856 


88884444 
7，7，7, 7, 3， 3, 3，3, 
6 ’ 6 ’ 6 ’ 6 ’ 2 ’ 2 ’ 2 ’ 2 , 
5 5 5 5 1111 


12345678 
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where C p and C y denote the information matrices for the row and column 
designs, respectively. More specifically, for the class Dq (6.49) can be written as 



In addition, for any design in Dq C satisfies the condition 

C = C p C y /r 0 (6.51) 


(John and Eccleston, 1986). The relationships (6.50) and (6.51), together with 
the fact that C ， C p , and C y have the same eigenvectors and that the eigenvalues 
of C are those of C p or C y or 0 (John and Eccleston, 1986)，it is possible to 
express the efficiency factor E py of a row-column design in Do in terms of 
the efficiency factors E p and E y of the row and column component designs, 
respectively, as 


. _ _ EpEy _ 

PY = E p + Ey- E p Ey 


(6.52) 


(see Eccleston and McGilchrist, 1986; John and Eccleston, 1986). For designs not 
in Do the right-hand side of (6.52) represents an upper bound for E py (Eccleston 
and McGilchrist, 1986). 

We shall illustrate the relationship (6.52) numerically in terms of the design 
of Example 6.13. 


Example 6.13 (Continued) Inspection shows that the design belongs to Do 
with ro = 2. The SAS output in Table 6.5 confirms that the design is connected. 

To obtain E p and E y (which in this case are identical) we use the system of 
equations (4.34) and solve for go, 容 1 ，客 2 , 容 3 to be used in 

var(f^ - z k f) = 2(g 0 - g u )^ 

[see (4.39)] for the three types of associates. We find go = 0, g\ = —g 2 = 
—1with n\ = 4, n 2 = 1 , "3 = 2 , and hence 

av. var(r^ — = f = 1-2857 

Thus, from (1.108) E p = E y = 1/1.2857 = .7778 [we should point out that this 
value is slightly different from what would have been obtained from (4.41)，which 
is defined as the weighted average of the three efficiencies E\, E 2 , E^]. Using 
(6.52) we obtain 

(.7778) 2 

F = _ _ _ _ _ = 6164 

PY 2 x .7778 - (.7778) 2 . 
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Table 6.5 Data and Analysis for Row-Column Design With PBIB (3) Row and 
Column Designs (^ = 8, r = 4, c = 4) 


options nodate pageno=l ； 
data rcpbib ； 
input R C T Y @@; 
datalines ； 

111 20 122 23 135 25 146 21 

213 30 224 29 237 25 248 31 

317 34 326 35 332 30 343 35 

415 33 428 38 434 34 441 31 

/ 

run ； 

proc print data=rcpbib; 
titlel 'TABLE 6.5'; 

title2 'DATA FOR ROW-COLUMN DESIGN'; 
title3 'W/PBIB(3) ROW AND COLUMN DESIGNS'; 
title4 't=8, r=4, c=4 1 ； 
run ； 


proc glm data=rcpbib ； 

class R C T ； 

model Y=C R T/e ； 

estimate 1 1-2 1 T 1 - 1 ； 

estimate 1 1-3' T 1 0 -1; 

estimate 1 1-4 1 T 1 0 0 -1; 

estimate 1 1-5' T 1 0 0 0 -1 ； 

estimate 1 1-6 1 T 1 0 0 0 0 -1 ； 

estimate '1-7' T 1 0 0 0 0 0 -1 ； 

estimate '1-8' T1000000 -1 ； 

title2 'ANALYSIS OF DATA'; 

title3 'FROM ROW-COLUMN DESIGN'; 

title4 'W/SIMPLE TREATMENT CONTRASTS'; 

run ； 

Obs R C T 

1111 

2 12 2 

3 13 5 

4 14 6 

5 2 13 

6 2 2 4 

7 2 3 7 

8 2 4 8 

9 3 17 

10 3 2 6 

11 3 3 2 

12 3 4 3 


Y 

20 

23 

25 

21 

30 

29 
25 

31 

34 

35 

30 
35 
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Table 6.5 (Continued) 


13 4 1 5 

14 4 2 8 

15 4 3 4 

16 4 4 1 


33 
38 

34 
31 


ANALYSIS OF DATA FROM ROW-COLUMN 
DESIGN W/SIMPLE TREATMENT CONTRASTS 


The GLM Procedure 


Class Level Information 


Class Levels Values 


R 

C 

T 


4 12 3 4 

4 12 3 4 

8 12345678 


Number of observations 16 


General Form of Estimable Functions 


Effect 


Coefficients 

Intercept 


LI 

C 

1 

L2 

C 

2 

L3 

C 

3 

L4 

C 

4 

L1-L2-L3-L4 

R 

1 

L6 

R 

2 

L7 

R 

3 

L8 

R 

4 

L1-L6-L7-L8 

T 

1 

L10 

T 

2 

Lll 

T 

3 

L12 

T 

4 

L13 

T 

5 

L14 

T 

6 

L15 

T 

7 

L16 


T 8 L1-L10-L11-L12-L13-L14-L15-L16 


Dependent Variable : Y 
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Table 6.5 (Continued) 


Sum of 


Source 


DF Squares 

Mean Square 

F 

Value 

Pr > F 

Model 


13 426.5000000 

32.8076923 


7.09 

0.1302 

Error 


2 9.2500000 

4.6250000 




Corrected 






Total 


15 435.7500000 






R-Square Coeff Var Root MSE 


Y Mean 



0.978772 7.259346 2.150581 

29.62500 


Source 

DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

C 

3 

16.2500000 

5.4166667 


1.17 

0.4913 

R 

3 

357.2500000 

119.0833333 


25.75 

0.0376 

T 

7 

53.0000000 

7.5714286 


1.64 

0.4305 

Source 

DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

C 

3 

10.2500000 

3.4166667 


0.74 

0.6189 

R 

3 

285.5000000 

95.1666667 


20.58 

0.0467 

T 

7 

53.0000000 

7.5714286 


1.64 

0.4305 




Standard 




Parameter 

Estimate 

Error t 

Value 

Pr > |t| 

1-2 


0.50000000 

2.84495167 

0 

.18 

0.8767 

1-3 


-4.75000000 

2.63391344 

-1 

.80 

0.2131 

1-4 


-3.25000000 

2.84495167 

-1 

• 14 

0.3716 

1-5 


-3.50000000 

2.40442301 

-1 

.46 

0.2828 

1-6 

1-7 


-1.00000000 

2.63391344 

-0 

• 38 

0.7407 


-1.75000000 

2.84495167 

-0 

.62 

0.6011 

1-8 


-6.25000000 

2.63391344 

-2 

.37 

0.1410 


This value can be confirmed by making use of the SAS output in Table 6.5, 
in particular the standard errors of simple treatment contrasts and the fact that, 
because of the combinatorial structure of the design, there are three different 
standard errors with frequency 3, 3, and 1 ， respectively. Thus, using the general 
definition of E given in (1.108 )， 



2/2 

[3 x (2.8449) 2 + 3 x (2.6339) 2 + (2.4044) 2 ]/(7 x 4.625) 


= 0.6364 


It follows, of course, from (6.52) that if, say, E p = l, then E py = E y . 


□ 
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6.6.8 Construction 

From our discussion in the previous sections concerning the information matrix 
and the efficiency factor of a row-column design, it comes as no surprise that 
the construction of an efficient row-column design starts with efficient row and 
column component designs. This is, of course, easy to do for regular row-column 
designs for which at least one of the component designs has efficiency equal to 
1. There are, however, no general rules for amalgamating two PBIB designs. 
Rather, we shall describe one construction method due to John and Eccleston 
(1986), which is based on the generalized cyclic incomplete block designs of 
Jarrett and Hall (1978) (see Section 6.3). 

To construct a design for ? = r x 5 treatments such that each treatment is 
replicated ro times (with ro < r) in r rows and c = ros columns, we consider 
the r residue classes Si = [i, / + r,..., / + r{s — 1)} for / = 0, 1 ,..., r — 1 , and 
form an r x ro array where each column contains one element from each of the r 
residue classes Si. We shall refer to this array as the generator array. From each 
such column we generate 5 — 1 further columns by adding successively r mod t 
to each element in that column. Thus each initial column generates a complete 
replicate of the t treatments, and the columns of the r x r^s array represent the 
blocks of the generalized cyclic incomplete block design with ro replications for 
each treatment. 

In order to obtain a design in Dq we need to ensure that each row in the 
generator array contains at most one element from each residue class. Then each 
row will consist of elements from ro residue classes and hence each row and 
each column will have exactly ro elements in common. Thus the final design 
satisfies the condition of adjusted orthogonality. 

To illustrate this method we consider the following example. 

Example 6.14 Suppose ? = 12 = 3 x 4, r = 3, 5 1 = 4, ro = 2, c = 8. The 
residue classes are 

5 0 = {0, 3, 6, 9} 5i ={1,4,7,10} & 5, 8, 11} 

Choose the generator array as 

0 7 

1 11 

2 3 

Then the final row-column design is 

Column 



1 

2 

3 

4 

5 

6 

7 

8 

1 

0 

3 

6 

9 

7 

10 

1 

4 

2 

1 

4 

7 

10 

11 

2 

5 

8 

3 

2 

5 

8 

11 

3 

6 

9 

0 
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Closer inspection shows that both row and column designs are actually PBIB(2) 
designs. □ 

As pointed out by John and Williams (1995)，the method described above 
may not always lead to the most efficient row-column designs. Alternatively, 
computer algorithms have been proposed that rely on iterative improvements of 
a starting design or a starting component design by using exchange algorithms. 
The aim is to construct a design with as high an efficiency factor as possible. 
The criterion used most often in this context is that of (M, S) optimality since it 
is most amenable to computational procedures. We shall not go into details, but 
rather refer the reader to the appropriate literature, such as Russell, Eccleston, and 
Knudsen (1981)，Nguyen and Williams (1993)，Venables and Eccleston (1993 )， 
and John and Williams (1995). 


6.6.9 Resolvable Row-Column Designs 

It is sometimes desirable, especially when the number of treatments is large or 
when the experiment is done sequentially, to have the treatments arranged in a 
resolvable design. This means that the design consists of several replicates such 
that each replicate constitutes a row-column design and each treatment occurs 
exactly once in each replicate. An example of a resolvable design for t = c 2 are 
the lattice square design of Yates (1940b) (see Chapter 18). 

For a resolvable row-column design with R distinct replications model (6.1) 
needs to be changed to 


yijkt = ^i-\- Ri -\- Pij + yik + Q + eiju (6.53) 

with i = 1,2, ... ， R., j = 1 ， 2, ... ， r; 众 = l, 2,..., c; l = 1,2,... and where 
Ri are the replicate effects, p", are the row and column effects, respectively, 
nested within replicates, and xn are the treatment effects. Because of the nesting 
structure these designs are a special case of what has been referred to as nested 
row-column designs (see, e.g., Morgan, 1996; John and Williams, 1995). 

In matrix notation model (6.53) can be written as 

y = fiO + X j^R + Xpp + ^-yY + X T x + e (6.54) 

where, in contrast to (6.2), X p is of dimension n x Rr, and X y is of dimen¬ 
sion n x Rc, where n = Rrc = Rt. Using (6.54) we can write out the normal 
equations and then the reduced normal equations. In terms of the design matrices 
in (6.54) the information matrix can be written as (see Morgan and Bailey, 2000) 

C = X' r (l-^- X p X' p - i XyX' y + -i- (6.55) 
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With X f T X p = Np, X T X y = N y , and X^Xr = Nr we can rewrite (6.55) as 

C = RI — -N P N' - -N Y N' + - N r N' r 

c y r Y rc 

or, since the replicate design is an RCBD, 

C = RI - -N„N f n - -NyN r v + — 33 r (6.56) 

c p r y rc 

When constructing resolvable row-column designs care must be taken to 
ensure that rank (C) = t — l. This holds for the method of constructing lattice 
square designs (see Chapter 18). The method described in Section 6.6.7 can be 
used here also in conjunction with the interchange algorithm of Nguyen and 
Williams (1993) to produce connected and efficient designs. 

Variations of the model (6.53) may be considered as dictated by the experimen¬ 
tal conditions. For example, if in an industrial experiment the rows are different 
batches of raw material, the columns are different machines, the replicates are 
different times, and the experiment is conducted in the same factory, then the 
machines may be the same for the different replicates. Or, in an agriculture the 
replicates may be contiguous pieces of land so that, for example, the columns are 
the same for all the replicates. In those cases the columns are no longer nested in 
replicates but rather crossed with replicates, and hence model (6.53) changes to 

yijkt = M + ^/ + Pij + H Q + eiju (6.57) 

(this is similar to the various forms of replicated Latin squares in 1.10.3). 

Finally, a further modification, or restriction, of the above design is considered 
by John and Williams (1995). In addition to having each treatment occur once 
in each replicate, each treatment should not occur more than once (or nearly 
equally often) in each column as, for example, in the design given below for 
t = 12 = 3 x 4, i? = 3, r = 3, c = 4: 


Replicate 

1 

0 

3 

6 

9 


1 

4 

7 

10 


2 

5 

8 

11 

2 

4 

8 

9 

0 


5 

6 

10 

1 


3 

7 

11 

2 

3 

7 

10 

0 

5 


8 

11 

1 

3 


4 

9 

2 

4 
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This arrangement may be useful in, for example, an agronomic experiment to pro¬ 
vide some protection against the loss of an entire column. This design is referred 
to as a Latinized row-column design (using terminology of Harshbarger and 
Davis, 1952). John and Williams (1995) recommend to modify model (6.57) to 


yi jkt = M + Ri + Pij + Kfe + (Ry)ik + Q + Cijkt (6.58) 

so that the effects Ri and (Ry)ik account for the R(c — 1) d.f. of the yij in model 
(6.53). Obviously, subject matter knowledge may determine whether model (6.53) 
or (6.57) is appropriate. 

From the above discussion it should be clear that the original row-column 
design may have to be modified in various ways to accommodate practical con¬ 
siderations and requirements, leading to still other forms of row-column designs. 



CHAPTER 7 


Two-Level Factorial Designs 


7.1 INTRODUCTION 

In the preceding chapters we have discussed various aspects of error control 
designs. Another component of an experimental design is, however, the treat¬ 
ment design (see Section 1.2.2.3)，in particular, designs with a factorial treatment 
structure (see LI 1.1 and 1.11.2). 

In this chapter we shall consider in great detail the situation where an exper¬ 
iment involves several factors, each having two levels. Such experiments are 
generally referred to as 2 n factorials, where n denotes the number of fac¬ 
tors, and hence 2 n denotes the total number of level combinations or treatment 
combinations. 

Although the 2 n factorial represents only a special case of the general p n 
factorial experiment, that is, n factors with p levels each, it deserves spe¬ 
cial consideration because of its practical importance and its special algebraic 
and combinatorial representation. At the same time, however, we shall use the 
2 n case to lay the foundation for the discussion of the more general case by 
introducing appropriate notation and mathematics suitable for generalization in 
later chapters. 


7.2 CASE OF TWO FACTORS 

7.2.1 Definition of Main Effects and Interaction 

Let us denote the two factors by A and B, where factor A has levels ao and a\, 
and factor B has levels bo and b\. We shall refer to ao and bo as 0 levels and to 
a\ and b\ as 1 levels of the two factors, respectively. The four possible treatment 
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combinations may be represented as 

aobo 

a\bo 

aobi 

a\b\ 


(7.1) 


We can then define the effect of factor A, that is, the effect of changing A from 
ao to a\, at each of the two levels of factor B. Denoting these effects by A (bo) 
and A(b\), respectively, we define 


A(b 0 ) = aib 0 -a 0 bo 2 ) 

A(b\) = a\b\ — aob\ 

where aibj now denotes the true response of the treatment combination aibj. We 
shall use this dual meaning of aibj throughout, but from the context it should 
always be clear whether we mean the treatment combination or the response of 
that treatment combination. The effect of A, also referred to as the main effect 
A, is now defined as the average of the so-called simple effects A (bo) and A{b\) 
of (7.2), denoted by A, that is, 

A ^ \[A{b 0 ) + A{b x )} (7.3) 

or, by using (7.2), 

A = \[a\bo - a^bo + a\b\ - aob\] (7.4) 

or, symbolically, 

A = — ao)(b\ + bo) (7.5) 

where the expression is to be expanded algebraically. 

If the two factors are acting independently, we would expect A (bo) and A{b\) 
of (7.2) to be equal, but, in general they will be different, and their difference is a 
measure of the extent to which the two factors A and B interact. This interaction 
is denoted by AB (or A x B), and defined as 

AB = \[A(b x ) - A(b^ (7 6) 

AB = \[a\b\ — aob\ — a\bo + aobo] 

or, symbolically, 

AB = ^(a\ — aQ)(b\ — bo) (7.7) 
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The factor ^ here is a matter of convention and is used to express the interaction 
effect on a per-unit-difference basis, there being two differences in (7.6). We 
shall use this convention throughout. 

Along the same lines we may also obtain the effect of factor B by first defining 
the simple effects: 


B(ao) = aob\ - aobo 
B{a\) = a\b\ — a\bo 

and then 

B = ^[B(ao) + 

=\[aob\ - aobo + a\b\ - a\bo] 

= + cio)(b\ — bo) 

B(ao) and B(a\) from (7.8) can also be used to define the interaction between 
factors B and A as 


(7.8) 


(7.9) 


BA= - B(a 0 )] 

=\\a\b\ - a\bo - aob\ + aobo] 

= — ao)(b\ — bo) = AB 

which shows that in defining the interaction we need not bother with the order 
in which we write down the letters. 


7.2.2 Orthogonal Contrasts 

We note that the effects A, B and the interaction AB are three mutually orthog¬ 
onal contrasts of the true responses of the four treatment combinations: 



aobo 

a\bo aobi 

a\b\ 

2A 

—— 

+ 

+ 

2B 

— 

+ 

+ 

2AB 

+ 

— — 

+ 


which follows immediately from (7.4), (7.9), and (7.6). If we denote the mean 
response of the four treatment combinations by M ， we have the following trans¬ 
formation of the response vector (aobo, a\bo, a^b\, a\b\)'\ 


/4M\ 

2A 

2B 

k 2ABj 




1 


-1 




a\bo 

aob\ 

\a\b\) 


(7.10) 
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It is interesting to note that the transformation matrix in (7.10) can be written as 
the Kronecker product 


where the matrix 



(7.11) 


can be considered as the transformation matrix for the one factor situation, that 

is, 


(TH-： ：)(：) 


Hence, by using (7.11), we can write (7.10) as 

/4M\ /a 0 bo\ 


2A 

2B 

k 2ABj 


=(T xT) 


a\bo 

aob\ 

\ci\b\J 


(7.12) 


7.2.3 Parameterizations of Treatment Responses 

Relationship (7.10) and hence (7.12) may be inverted, that is, 


kobo' 

a\bo 

aob\ 

\ciib\J 


/ 4M\ 

= ( rxr )— 1 K 

k 2ABj 


and since (T x T)~ l = T~ l x T~ l and 



we obtain 


/ 1 

a\bo _ 1 1 

aob\ ~ 4 1 

\a\b\) \ 1 



-1 

-1 

1 

1 



/4M\ 

2A 

2B 

k 2ABj 


(7.13) 
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that is, 


ci^bo = M — ^ A — 士 + ^AB 

d\b() = M -h i-A — i B — hAB 

2 2 2 (7.14) 

ciQb\ = Ad — 士 A + 士 B — ^AB 

o,\b\ = M 

Parameterizations of type (7.14) will become very useful in later discussions 
of the 2 n factorial. At this point it may be of interest to relate representation 
(7.14) to the usual parameterization of a factorial structure in linear model terms. 
If /iij denotes the true response of the treatment combination aibj, the usual 
parametric model (see Section 1.4.13.2) is given as 

fMj = M + 的 + A/ + ㈣、 ij (7.15) 


It is obvious that we cannot relate M, A, B, AB uniquely to the parameters in 
(7.15) without defining them more precisely. For that reason we write (7.15) as 
the identity 


^ij = M.. + (Mi. — + (M.y — M..) + (f^ij ~ ^i.~ + M..) 

where, in the present context with /, j = 0, l, 

M/. = + Mn) (’ = 0 ， 1) 

Kj = 圣(陶 + Ml；) (j = 0, 1) 

M.. = \ ^ij = M 
U 


so that 


with 


= M/. M.. 

Pj = f^.j ~ M.. 

(o^)ij = fMj —lh —Vij +1L 


ao + oi\ = 0 

A) + A = 0 

_oo + _io = 0 
⑽ ) 01 + (of^)n = 0 
⑽ )oo + ⑽ )oi = 0 
⑽） 10 + (q^) u =0 


(7.16) 


(7.17) 
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It follows then from (7.10) that 

M = /I 
A = a\ — ao 
B = - Po 

AB = \ [(aP)oo -⑽ )io _ (aP)oi + ⑽ )ii] 
or, using the relationships (7.17), 


M = /i 
A = 2a i 
B = 

AB = 2(ap)n 


that is, 


(2M\ 


f /x 、 

A 

= 2 

Oil 

B 

Pi 

\ AB ) 


〈(Oil/ 


Hence we obtain from (7.14) 


aobo = /x - + (aP)n 

ci\bo = /x + ai - - (ap)n 

cioh = /x -ofi + ^i - (ap)n 
a\b\ = /x + + A + (of/3)n 


This again can be expressed as a linear transformation using the matrix T of 
(7.11) as ^ 


Hence 


ci\bo 

aoh 

\a\b\) 


= 4(T x T)~ l 



A 

\(aP)nJ 


’ fi 、 
Oil 
Pi 


\(ap)nj 


=\(T x T) 


’a 0 办 o 、 

a\bo 

aob\ 

\a\b\) 
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We can, therefore, easily relate the parameters customarily used in factorial 
experiments to those commonly used in classificatory linear models and relate 
both, in turn, to the true responses. 


7.2.4 Alternative Representation of Treatment Combinations, 
Main Effects, and Interaction 


We now return to Eqs. (7.10) and (7.14) using two alternative ways of repre¬ 
senting the treatment combinations and hence the responses, one that is peculiar 
to the 2 n factorial and simplifies the previous notation in a very useful way, 
and one that leads to a more formal, mathematical definition of main effects and 
interaction, which can be generalized for other factorials as well. 

Instead of writing the treatment combinations as in (7.1), we shall now write 
them as (1) a, b, ab, respectively, that is, in each combination the letter corre¬ 
sponding to a 0 level is replaced by 1 and the subscript is dropped in a letter 
corresponding to a 1 level. With this notation (7.5), (7.9), and (7.7) can be written 
formally as 


A = ^(a — 1)(Z? + 1) 

B = \{a+l){b - 1) (7.18) 

AB = \(a - \){b - 1) 


each of which is then multiplied out as if these were mathematical quantities. We 
note that in the expressions above terms like (a — 1) do not have any meaning 
in themselves, only the quantities (1) a ， b ， ab after multiplication are meaning¬ 
ful. A simple rule for writing down the expressions in (7.18) is as follows: 
A minus sign appears in any factor on the right if the corresponding letter is 
present on the left, otherwise a plus sign appears. It is crucial in this nota¬ 
tion to adhere to the rule that we have used, that effects and interactions are 
denoted by capital letters and treatment combinations and responses are denoted 
by small letters. 

Still another way of representing the treatment combinations is by pairs 
(x\,X 2 ), where x\ represents the level of factor A and X 2 represents the level 
of factor B, with xi =0, 1(/ = 1,2). The four treatment combinations are then 
written as (0, 0), (1, 0), (0, 1), (1, 1). By looking at (7.10) we notice that, apart 
from a constant, main effect A is a comparison of treatment combinations with 
x\ = \ versus those with x\ = 0. Similarly, 5 is a comparison of treatment com¬ 
binations with X 2 = 1 versus those with ^2 = 0. And finally, AB is a comparison 
of treatment combinations satisfying x\ X 2 = 0 mod 2, that is, (0, 0) and (1, 1), 
versus those satisfying x\ X 2 = 1, that is, (1 ， 0) and (0 ， 1). Each of the three 
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basic equations for 

A: x\ = i 
B : = j 

AB: x\ X2 = k 

where /, j, k take on the values 0 and 1, and where the equations are always 
reduced mod 2, represents a partitioning of the set of four treatment combinations 
into two sets of two treatment combinations each, that is, 


xi = 0: (0, 0), (0, 1) 

=1： (1,0), (1,1) 

^2 = 0 : ( 0 , 0 ),( 1 , 0 ) 

= 1:(0, 1)，（1，1) 

义 l + 又 2 = 0: (0, 0)，（1, 1) 

= 1 :( 1 , 0 ),( 0 , 1 ) 

A unified way to express these partitions is to write them as 

a\X\ + a 2 X 2 = 0, 1 mod 2 


where a\,a 2 = 0, 1 but (a\, 0^2) ^ (0, 0). These partitions are orthogonal in the 
sense that for a given a\x\ + 0 , 2 x 2 = i the set of treatment combinations contains 
one from a[x\ + = 0 and one from = 1 where (o^, a f 2 ) ^ 

(ai, Qf2). This, of course, is equivalent to our earlier statement that A, B and AB 
represent orthogonal comparisons of the treatment combination responses. 


7.3 CASE OF THREE FACTORS 

We now consider three factors A, 5, and C, say, with levels ao, a\, bo, b\, co, c\, 
respectively. The eight possible treatment combinations can be represented, in 
the standard order, in the following three ways: 


aoboco 

(1) 

(0, 0 , 0) 

ci\boco 

a 

(1,0,0) 

ciobico 

b 

(0,1,0) 

a\b\co 

ab 

(1 ， 1,0) 


or 

or 

aoboci 

c 

(0, 0’ 1) 

aiboci 

ac 

(1,0,1) 

aohci 

be 

(0’ 1,1) 

a\b\C\ 

abc 

(1,1,1) 
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7.3.1 Definition of Main Effects and Interactions 

Denoting the true response by the same symbol as the treatment combination, 
we define now four simple effects associated with factor A: 

A(bo,c 0 ) = aib 0 co - a 0 boc 0 

A(b u c 0 ) = aibico - a 0 bic 0 ^ ^ 

M^o, c\) = aiboc\ - aoboci 

A(b\, c\) = a\b\c\ — aob\c\ 

where A(bi, cj) is the effect of factor A with factor B at level bi(i = 0, 1) and 
factor C at level Cj(j = 0, 1). The average of the four simple effects of (7.19) 
is then defined as the effect of factor A, denoted by A, that is, 

^ = I A(bi,Cj) 
ij 

=\(a\boco + a\b\co + a\boc\ + a\b\c\ 

—aobQCo — aobico — aoboc\ — aob\c\) 

or, symbolically, 

^ = \(m\ ~ ao) ( 办 l + ^o)(^i + ^o) (7.20) 

and using the alternative notation 

A = ^(a ab ac + abc — (l) — b — c — be) 

=\(a - l)(b + l)(c + l) (7.21) 

Next we define the interaction between factors A and B, denoted by AB, by 
considering the difference between the following average effects: 

A(b 0 ,c) = j[A(bo, co) A(b 0 , ci)] 

and 

A(bi,c) = j[A(b\, co) A(b\, ci)] 

Then 

AB = ^[A(bi,c) - A(b 0 ,c)] 

=\{a\b\co - aob\CQ + a\b\c\ - a^b\c\ 

-a\boco + aoboco - a\b^c\ + aoboc\) 

= i(ai - a 0 )(bi - b 0 )(ci + c 0 ) 


(7.22) 
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or, using the alternative notation 

AB = \\ab + abc (l) c — a — b — ac — be] 

=\(a - l)(b - l)(c + l) (7.23) 


We can also evaluate the simple interaction between factors A and B for each 
level of factor C, using an obvious modification of (7.6), as 

AB(co) = \{a\b\co + aoboco - a\boco - aob\co) (7.24) 


and 


AB(c\) = \{a\b\c\ + a^boci — a\boc\ — aob\c\) (7.25) 

Obviously, 

AB = \[AB(c 0 ) + AB( Cl )] 

The two interactions given by (7.24) and (7.25) may be different, and, as a 
measure of the extent to which they are different, we define the three-factor 
interaction of factors A, B, and C, denoted by ABC, as 

ABC = \[AB( Cl )-AB(co)] 

=\(a\bic\ + aoboci - a\boc\ - aob\C\ 

一 a\b\co - aoboco + a\boco + aob\co) 

=|(ai — ^0)(^1 — ^0)(^1 — <^0) ( 7 . 26 ) 


ABC = \yabc - a b -\- c — (l) — ab — ac — be] 

=\{a — l)(b — l)(c - 1) (7.27) 

Similar arguments can be used to define the remaining main effects and two-factor 
interactions. We summarize them below: 

B = + ao ) (办 1 — bo)(c\ + co) 

C = \(ai + a 0 )(bi + b 0 )(ci - c 0 ) 

AC = ^(a\ — ao)(b\ + bo)(c\ — co) 

BC = ^{a\ + ao){b\ — bo){c\ — cq) 
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or alternatively, 

5 = I(a+l)(^-l)(c+l) 
C = \(a + l)(b + l)(c - l) 
AC = \{a - \){b + \)(c - \) 
BC ^ \{a + \){b - \)(c - \) 


Just as in the two-factor case we can express the effects and interactions defined 
above as a transformation of the treatment responses: 


8M - 


_ 1 

1 

1 

1 

1 

1 

1 

1 - 


~aoboCQ~ 

4A 


-1 

1 

-1 

1 

-1 

1 

-1 

1 


aiboco 

4B 


-1 

-1 

1 

1 

-1 

-1 

1 

1 


dob\co 

4AB 


1 

-1 

-1 

1 

1 

-1 

-1 

1 


a \ b\CQ 

4C 


-1 

-1 

-1 

-1 

1 

1 

1 

1 


ao 厶 oci 

4AC 


1 

-1 

1 

-1 

-1 

1 

-1 

1 


a\boci 

ABC 


1 

1 

-1 

-1 

-1 

-1 

1 

1 


aob\c\ 

4ABC_ 


_-l 

1 

1 

-1 

1 

-1 

-1 

1 _ 


.a\b\c\_ 


(7.28) 

where M is the mean response of the 8 treatment combinations. We shall com¬ 
ment briefly on the transformation matrix in (7.28): (i) It can be verified easily 
that, with the standard order of writing the symbols for the treatment combina¬ 
tions and the effects (main effects and interactions) that we have adopted, the 
matrix can be expressed as the threefold Kronecker product T x T x T, where 
T is given by (7.11); (ii) the rows of the transformation matrix are orthogo¬ 
nal to each other; (iii) except for the first row, the rows of the transformation 
matrix represent comparisons of the treatment responses; (iv) it is easy to write 
down the elements in any given row: for A, B, and C we have a +1 if the 
corresponding factor appears at the 1 level in a treatment combination, and a 
—1 if the corresponding factor appears at the 0 level; for interactions we just 
multiply the corresponding elements for the main effects involved; for the alter¬ 
nate representation of treatment combinations this rule can be formulated as 
follows: Let X be any effect symbol; if X consists of an odd (even) number 
of factor symbols, then an element in the row corresponding to X is +1 if the 
associated treatment combination has an odd (even) number of letters in com¬ 
mon with X, and it is — 1 if the associated treatment combination has an even 
(odd) number of letters in common with X, where zero is considered to be an 
even number. 
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7.3.2 Parameterization of Treatment Responses 

Inverting (7.28) leads to a representation of treatment responses in terms of the 
overall mean, main effects, and interactions: 


aoboco~ 


一 1 

-1 

-1 

1 

-1 

1 

1 

-1 _ 


- 8 M 一 

ciiboco 


1 

1 

-1 

-1 

-1 

-1 

1 

1 


4A 

ciobico 


1 

-1 

1 

-1 

-1 

1 

-1 

1 


4B 

a\b\co 


1 

1 

1 

1 

-1 

-1 

-1 

-1 


4AB 

cioboci 


1 

-1 

-1 

1 

1 

-1 

-1 

1 


4C 

a\boc\ 


1 

1 

-1 

-1 

1 

1 

-1 

-1 


4AC 

aob\c\ 


1 

-1 

1 

-1 

1 

-1 

1 

-1 


4BC 

d\b\c\_ 


_ 1 

1 

1 

1 

1 

1 

1 

1 _ 


_4ABC_ 


(7.29) 

The rule for writing down the elements in the matrix in (7.29) corresponding to 
the treatment combination aibjCk is: The coefficient for M is +1; the coefficient 
for the main effects is +1 if the corresponding factor appears at the 1 level and 
—1 if it appears at the 0 level; the coefficient for an interaction is the product 
of the coefficients for the main effects involved. An alternative way of writing 
(7.29) then is 

latbjCk = 2M + (-1) 1_1 A + (-l) 1- - 7 ^ + (-1) [1_0+(1_7)] AB 
+ {-\) x ~ k C + (-l) [(1_0+(1_t) Uc 

+ (—l) [( 1 _j )+( 1 - 幻 ] 5C + (—l/G-G+G-P+d -幻 U5C (7.30) 

where /, j,k = 0, 1 . 

7.3.3 The x - Representation 

To conclude this section, we shall discuss briefly the formal definition of main 
effects and interactions in terms of orthogonal partitions, using what we shall 
henceforth call the x representation of treatment combinations. A treatment 
combination is given by = (x\,X 2 , x^) with Xi = 0, 1( / = 1, 2, 3). We then 
consider all partitions of the form 

a\x\ + a ， 2X2 + o?3X3 = 0 , 1 mod 2 

where a/ = 0, 1( / = 1, 2, 3) but («i, « 2 , 0 ^ 3 ) ^ (0, 0, 0). Each partition divides 
the set of treatment combinations into two sets of four treatment combinations 
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each. The comparison of the average response for these two sets defines then a 
main effect or interaction. Specifically these sets are 


A: 

Oil 

= 1,«2 

=Qf3 = 0 

i.e. xi = 0 

VS. X\ = \ 

B: 

0^2 

= 1，《1 

=Qf3 = 0 

i.e. X 2 = 0 

VS. X 2 = 1 

AB: 

Oil 

= 0 i 2 = 

1 , Qf 3 = 0 

i.e. x\ X 2 = 0 
VS. X\ X 2 = 1 

C: 

(X3 

= 1 肩 

=Qf 2 = 0 

i.e. X 3 = 0 

VS. X 3 = 1 

AC: 


=«3 = 

1 , Qf 2 = 0 

i.e. xi + X 3 = 0 
VS. Xi + X 3 = 1 

BC: 

Oil 

= 0 , Qf 2 

=Qf3 = 1 

i.e. X 2 + X 3 = 0 

VS. X 2 + X 3 = 1 

ABC: 

0^1 

= Oi 2 = 

^3 = 1 

i.e. x\ X 2 ~\~ ^3 

VS. Xi + X2 + X 3 


where every equation is reduced mod 2. We refer to each a = (ai, c^, as a 
partition of the factor space X = {x = (x\,X 2 , X 3 )’； x/ = 0, 1; / = 1, 2, 3}. 


7.4 GENERAL CASE 

We now have n factors and hence 2 n treatment combinations. It is convenient 
to denote the factors by A\, A 2 ,..., A n where factor A/ has levels a/o and 
an(i = 1,2,..., n), or simply x/ = 0 and 1. Then a treatment combination can 
be represented again in three different forms: ( 1 ) the explicit form: 

a hi a2 h •. an jn 

where ji = 0 or 1 (/ = 1,2,, n); ( 2 ) the condensed form: 

a^a^ 2 - - - a^ n (7.31) 

where Xi = 0 or 1 with = 1 and a] = at and all unity terms in (7.31) are 
ignored; (3) the x representation 

, X2, • . . ， X-n) 

where, as in ( 2 ), x/ = 0 or 1 . 
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7.4.1 Definition of Main Effects and Interactions 

Conforming to the general expression for a partition of the 2 n treatment combi¬ 
nations into two sets of 2 W — 1 treatment combinations each, that is, 

a\x\ + a 2 X 2 H - h ot n x n = 0, 1 mod 2 (7.32) 

with oti =0, l,Xi =0, 1(/ = 1 , 2 ,..., n), we write the effects and interactions 
in the general form 

i A ? 1 % 2 ….々 （ 7.33) 

with = 1, A? = Ai, and we ignore all unity elements in this expression. If all 
oti = 0, then (7.33) is written as M ，the mean of the 2 n treatment combinations. 

Using (7.31) and (7.33) we now generalize the symbolic expressions given in 
the previous sections for main effects and interactions and write 

A^A ^ 2 ... Ap = ^ + (-in (7.34) 

i=l 

and 

We know that in (7.34) 2 n ~ l treatment combinations enter positively into 
A^ 1 A 0 ^ - - - An n and the remaining 2 n ~ l treatment combinations enter negatively. 
For the case ^ = 3 we have given an even-odd rule to determine which treatment 
combinations enter positively and which enter negatively, a rule due to Fisher 
(1949). We shall use (7.34) to derive such a rule for the general case. 

First we rewrite (7.34) as 

« 2 … W = ▲ flK 1 + (-1)'。] (7-35) 

We then note that each treatment combination contains either aj or a^{i = 
1,2,..., n). To determine the sign of a^a^ 2 - - - we therefore have to deter¬ 
mine the sign with which a? enters into this “product ”： If a/ = 0, then the sign 
for a? is +1 whether x, = 0 or 1; if at = 1, then the sign for a? is +1 for 
xi = 1 and —1 for Xi = 0 as is obvious from (7.35). Both statements can be 
combined into the statement that the sign for is (—lfG - 々 ) .Hence the sign 
of - - - an n is 


]~[[(_1 广 (1 一又 l} ] = (-i)Ew(i - 々） =(7.36) 



GENERAL CASE 


255 


Now ^2 1 at is the number of letters in A ^ 1 A^ 2 - - - A^ n after deletion of terms 
of the form A°j. Also, = 1 if at and A/ are present in (7.31) and (7.33), 
respectively. With (7.36) this establishes then the following rules. 

Rule 1 The treatment combination a^a^ 2 - - - enters positively into 
A ^ 1 A 0 ^ - - - A^ n if 8 = [number of letters in A ^ 1 A^ 2 - - - A^ n — number of letters 
in common between A^ 1 A^ 2 - - - A^ n and a^a^ 2 - - - is even, and it enters 
negatively if <5 is odd. 

This leads immediately to the alternative formulation in rule V. 

Rule V If A ^ 1 A^ 2 - - - A^ n has an odd (even) number of letters, then the 
treatment combination a^a^ 2 - - - enters positively into A ? 1 Ag 2 … A^ n if it 
has an odd (even) number of letters in common with A^A^ 2 - - - A^ n , and it 
enters negatively if it has an even (odd) number of letters in common with 

Based on the method of orthogonal partitioning as given in (7.32) and rule 
1 ’， we can give yet another expression for A^ 1 A^ 2 - - - A^ n that will prove to be 
useful for further discussion of the general factorial experiment. We know that 
A^A^ 2 - - - An n is, apart from a constant, a comparison of responses of treatment 
combinations satisfying the equation 


a\x\ + a 2 X 2 + -- 

.+ ^n-^n = 0 

mod 2 

(7.37) 

versus those satisfying the equation 




a\X\ + (X2X2 + -- 

• + OC n X n = 1 

mod 2 

(7.38) 


Now, if in A ^ 1 A^ 2 - - - An n the sum at is odd, then all treatment combi¬ 
nations with atXi odd, that is, satisfying Eq. (7.38), enter positively into 
A^A^ 2 - - - An n and those with a i x i even, that is, satisfying Eq. (7.37), enter 
negatively into A 0 ^ 1 A^ 2 - - - A ^ n . If on the other hand a/ is even, then all treat¬ 
ment combinations satisfying (7.37) enter positively, and those satisfying (7.38) 
enter negatively. Let 

(A ^ 1 A ^ 2 - - - A^ n )o = [mean response of all treatment 

combinations satisfying E diXi = 0 ] - M 

i 

and 

(A ^ 1 A 0 ^ - - - A^ n )\ = [mean response of all treatment 

combinations satisfying E atXi = 1 ] _ M 


(7.39) 


(7.40) 
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We can then write 

= « A^-.-A^h - (Af A^..-A^) 0 if is odd 

= (A^A^ 2 ••• A^")o- (A^A^ 2 A««)i if J2 a i is even 

or, combining both cases, 

Aj'Af … 々 =(-1)E«, (A «i 々 ...d (A^A^ 2 ■■- A«")i] (7.41) 

7.4.2 Parameterization of Treatment Responses 

We shall use this result now to derive an expression, due to Kempthorne (1952), 
for the parameterization of treatment combination responses in terms of main 
effects and interactions, that is, the generalization of (7.30) for the 2 〃 case. To 
abbreviate the notation, let a! = (a\, ot 2 , ..., a n ), x f = (x\, X 2 ,..., x n ), a(x)= 
= A^A^ 2 Ef = (A a { l A^ 2 ■■- A a n ") t (i = 0, 1). 

First, we establish the following identity: For a fixed x 

a(x) = M + 

CL 

where summation is over all a # (0, 0 ,... ， 0 )，and a!x is reduced modulo 2. 
Expression (7.42) in itself is useful as a parameterization of a{x), and we shall 
return to it in connection with the general factorial experiment (see Chapter 
11). For the 2 n case, however, (7.42) can be rewritten in terms of effects and 
interactions as we shall see. 

Now, to prove (7.42), we make use of (7.39) and (7.40), the definitions of 
and and we observe that 

1. E^, x contains 2 n — l terms. 

2. Each E^, x contains the term —M. 

3 . Each E^, x contains the sum of 2 n ~ l treatment combinations divided by 
2 "-i. 

4. Each E^, x contains a{x). 

5. If y f = (yi, J2,. • • ， Jn)，with y ^ x, satisfies the same equation as x, then 
a\x — j) = 0, and there are exactly 2 n ~ l — 1 distinct solutions a, that is, 
each a(y) ^ a(x) occurs in exactly 2 n ~ l — 1 different E^, x . 

Collecting terms on the right-hand side of (7.42) we then obtain 

从 — （ 2 71 - l)M + ^ n _ x a{x) + — ^ a(j) 

y 拉 

= 2(2"- 1 - l)M + ^zra(x) + 2 ;:_T 1 [2 ,2 M — a(x)] = a(x) 


which proves (7.42). 
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We now use (7.42) to derive an expression of the form 

a{x) = M + x)E CL (7.43) 

a 

where the coefficients (p(a, x) are either or — Let 8f x = 1 if a’x = i and 
= 0 if ct f x ^ i (i = 0, 1). We can then write (7.42) as 

a(x) = M + J2\- 8 o XE o + ^ ，X ^] (7.44) 

a 

From (7.41) we have 

E a = (-l)S «-'(£■« - £«) 

and hence 

£■«-£■« = (7.45) 

Also, from (7.39) and (7.40), we have 

£« + £«= 0 (7.46) 

so that from (7.45) and (7.46) it follows that 

邱 = 4(—l)E<*/£■<* = — ( 士 ) （ _l)^>£ a (7.47) 

Substituting (7.47) into (7.44), we obtain 

a(x) = S^' x (-l )^ a： - 8f x (-l)^ ai E a 

a 

= M+\ - sf x )E a (7.48) 

a 

Now 

S^ x - Sf x = 1 if a’x = 0 

=—1 if a f x = 1 

that is, 

s^ ，x - sf x = (~l) a，x = (- 1 )-^ 

and hence (7.48) becomes 

a(x) = M + \ (7.49) 

a 

With regard to the contribution of E a to a(x), we can state the following rules. 
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Rule 2 The effect or interaction A^ 1 A^ 2 - - - A^ n enters positively into the 
treatment combination response a^a^ 2 - - - if 8 (defined in rule 1) is even and 
it enters negatively if 8 is odd. 

The reader may verify that for n = 3 this rule agrees with the expressions 
given in (7.29) and that (7.49) does indeed agree with (7.30). Also, the reader will 
notice that rule 2 is an exact analog to rule 1. Hence rule 2 can be reformulated 
as rule 2’. 

Rule 2 / If A^ 1 A^ 2 - - - A^ n has an odd (even) number of letters, then it will 
enter positively into a\ l a; 2 ... if both have an odd (even) number of letters 
in common, and it will enter negatively into a^a^ 2 - - - cin n if both have an even 
(odd) number of letters in common. 

This rule will become important in our discussion of systems of confounding 
(see Section 8.6) and fractional replication (see Section 13.3). 

7.4.3 Generalized Interactions 

To conclude this section we shall introduce the concept of a generalized interac¬ 
tion, a concept that also will take on great importance in connection with systems 
of confounding and fractional replication. To motivate this concept we return 
briefly to the 2 2 case and arrange the treatment combinations in the following 
2x2 table: 


B 

bo b\ 


CIO 

aobo 

aob\ 

A 

1 

2 

a\ 

aib 0 

a\b\ 


3 

4 


The contrast defining main effects A, B and the interaction AB can then be 
written down symbolically in terms of the cell labels [T], …， 4 as 

a: 0+0-0-H 
B - 回 + 0-0 -回 
AB - 0 + 0 -回-回 

Returning to the 2 n case, let X and Y denote two interactions, that is, 
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and 

Y = A^A^-'-A^ 

which are defined by the partitions a/JC/ = 0, 1 and = 0, 1, respec¬ 

tively. Now, any pair of equations 

〉: aiX 〖 = k 
i 

i 

with 众 ， € = 0, 1 is satisfied by 2 n ~ 2 treatment combinations. Accordingly, we 
can arrange the 2 n treatment combinations in a 2 x 2 table as follows: 


0 


22Pi x i = 

0 1 



0 

1 



1 



「 


0 

r 


There are three orthogonal comparisons among the four cells, namely 

X.. 回 +0- H - 回 
回 +0—0 一回 


The remaining contrast 


刃 + 0 — 回—回 (7.50) 

corresponds to that of the AB interaction in the 2 2 case above. We shall therefore 
denote the contrast (7.50) by XY and call it the generalized interaction (GI) 
of X and Y. From the table above it follows immediately that the treatment 
combinations in the four cells satisfy the following equations: 


〉 : OiiXi = 0 
i 

^ PiXi = 
i 

= 0 

hence also 

+ Pi)xi = 0 
i 

〉 : = 0 

i 

^2 PiXi = 
i 

=1 

hence also 

E(a,+ + Pi)xi = 1 

i 

〉: OiiXf = 1 
i 

= 

i 

= 0 

hence also 

+ Pi)^i = 1 
i 

〉 : = 1 

^2 PiXi = 

二 1 

hence also 

^ + Pi)xi = 0 
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According to (7.50), the generalized interaction XY can then also be defined in 
terms of the partition 

+ Pi)Xi = 0, 1 mod 2 
i 

This is the reason why we shall write the GI between X = A^ 1 A^ 2 - - - A^ n and 
Y = A 《 2 … An n formally as 

XY = A; 2 " 1 ^ 2 … Ap+A (7.51) 

where the superscripts in (7.51) are reduced modulo 2, letters with superscript 0 
being deleted and letters with superscript 1 being retained following previously 
established rules. 


7.5 INTERPRETATION OF EFFECTS AND INTERACTIONS 


The main effects and interactions are the building blocks of the theory of facto¬ 
rial experiments, a fact that will become more and more obvious in subsequent 
chapters. We have introduced the notion of main effects and interactions in such a 
way that they can be readily interpreted (see Sections 7.2 and 7.3). For example, 
the main effect A is the effect of changing factor A from the 0 level to the 1 
level, averaging over all other factors, or the interaction AB is a measure of the 
extent that the simple effects of A at both levels of B, averaged over all other 
factors, are different from each other. 

In addition to main effects and interactions, however, there may be other 
effects that are of interest to the experimenter. For example, in the presence of 
interaction AB, the simple effects A (bo) and A{b\) may be of interest, or we 
may ask: What is the effect of factor A when factor C is at the 0 level and we 
average over the levels of factor B1 Using the previously established notation, 
we denote this effect by A(b, co), and it is given by 

A(b, co) = \{a\b\co - a 0 biCo + aib 0 c 0 - a 0 boc 0 ) 

=\[ab-b + a- (1)] (7.52) 

Alternatively, (7.52) can be expressed as 

A(b, c 0 ) = A- AC (7.53) 

which follows immediately from the definition of A and AC, namely 

A = i(a-l)(^ + l)(c+l) 

AC = — \){b + l)(c — 1) 
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so that 

A-AC=\{a-\)(b + l) 

which is (7.52). More formally, we write (7.53) as the symbolic product 

A(b, c 0 ) = A(1 - C) 

As another example we may ask: What is the interaction between factors A and 
B if factor C is at the 1 level and we average over the levels of factor D1 The 
answer, of course, is 

AB(c\, d) = ^{a\b\C\do — a\boc\do — a^b\C\do + aoboC\do) 

+ a\b\c\d\ — a\boc\d\ — aob\c\d\ + aoboc\d\ 

=^{abc — ac — be + c + abed — acd — bed + cd) 

=^ (a — \){b — l)c(d + 1 ) 

= AB-\-ABC 

= AB(1 + C) (7.54) 

We shall refer to effects and interactions such as (7.52) and (7.54) as partial 
interactions. For notational purposes in the 2 n case we partition the set N = 
(Ai, A 2 ,..., A n ) of n factors into four disjoint sets: 

•^1 = 5 • • • 5 ^ i ni 

*^*2 = ^71 5 • • • » ^- jn 2 

^3 = 〜， •.. ， Ak n?) 

J<4 = Ai l ,Ai 2 ,...,Ai n4 

with Ni 1 J K2 IJ ^3 U *^4 = ^ + ^2 + ^3 + ^4 = ^ and Ni is always 

nonempty. We then write the partial interaction of all factors in Ki with all 
factors in N 2 at the 0 level, all factors in 3 Nf 3 at the 1 level, and averaging over 
both levels of all factors in X 4 as 

* * * ^in x (^/ 1 ， 0 , ... ， ^ ； „ 2 ,o ； h, ... ， ^„ 3 ,i ； … ，办 ” 4 ) (7.55) 

By definition, (7.55) is given in terms of treatment combinations as 

2 » 1+ 1 4 -i n (a < -!) n ak n ^ + i) (7 . 56) 

~ ie'Ni 
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For purposes of expressing (7.55) in terms of effects and interactions, we 
rewrite (7.56) as 


2«i+« 4 -i n (a, - ” n ” -1)] 

i^i j^2 

]~[ 2 ^ ak + 1) + (你 _ 1)] (仰 + 1) 

k€J < 3 Z 續 4 

Using (7.34) this product can then be written as the symbolic product 

n n (i -〜')n (i +a) (7 . 57) 

In other words: After expansion (7.57) provides a representation of the partial 
interaction (7.55) in terms of effects and interactions. This representation rather 
than (7.56) may prove to be useful if (estimates of) effects and interactions are 
available, but the individual responses are not (see also Section 13.8). 

We illustrate this result by the following example. 


Example 7.1 Let n = 1 , that is, we have factors A\, A2 ,..., A7. Suppose 
= (Ai, A2), N2 = (A3, A4), N3 = (A5), and W4 = Aj). Then 

AiA 2 (« 3 o, a 4 o, as\,a^,ai) = A1A2 — A1A2A3 — A1A2A4 + A1A2A3A4 

+ A1A2A5 — A1A2A3A5 — A1A2A4A5 
+ Aj A2A3A4A5 n 


7.6 ANALYSIS OF FACTORIAL EXPERIMENTS 

So far we have considered main effects and interactions as contrasts among the 
true treatment combination responses. These true responses are, of course, not 
available, but rather estimates of the true responses as the treatment combinations 
are used with a particular error control design. Consequently, we obtain estimates 
of the true effects and interactions. 

7.6.1 Linear Models 

In order to obtain such estimates, we utilize the fact that under additivity the 
observed yields will be given by a general linear model that characterizes the 
error control design that has been used. In its most general form such a model 
can be written as (using the notation of Section 1.4.2) 


j = + XsS + X T r 


(7.58) 
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where y is the vector of observations, 8 is the vector of blocking effects, r is the 
vector of treatment combination effects (by which we mean the true treatment 
combination response expressed as deviation from the true mean response), and 
Xs and X T are known design-model matrices. For example, for the completely 
randomized design (see 1.6) (7.58) is of the form 

y = 1x3 -X r r (7.59) 

For a randomized complete block design (see 1.9) it is of the form 

y = ^3 + X + X T r (7.60) 

or for a Latin square design (see 1.10) it is of the form 

y = + X pp + Xyy + X T r (7.61) 

The effects and interactions, as defined earlier, are then expressible as linear 
contrasts among the treatment combination effects, and we know that under the 
GMNLM (see 1.4.17) the best estimate of such a contrast is the same contrast 
of the estimates of the treatment effects. For models (7.59), (7.60), and (7.61) 
with the same number, r say, of replications for each treatment combination, 
the best estimate of any treatment contrast is given by the same contrast of the 
treatment means, that is, to obtain the estimate of the interaction A ^ 1 A^ 2 - - - A^ n 
we substitute for a(x) = Aj 1 A^ 2 - - - A^ n its observed mean, a(x) say, into (7.34), 
using rule 1 of Section 7.4.1. 

7.6.2 Yates Algorithm 

A convenient way of obtaining those estimates is to apply the Yates algorithm 
(Yates, 1937b) which consists of adding and subtracting the treatment means in 
an appropriate way through several steps. Mainly for historical reasons we shall 
illustrate this algorithm for the 2 3 factorial. The steps are as follows: 

Step 0: Write down the observed treatment means in standard order. (This is 
important since otherwise the algorithm will not work.) 

Step 1: Starting from the top, add adjacent pairs of treatment means and record 
them in the first 4 (= 2 n ~ l ) positions; for the same pairs, subtract the 
first from the second treatment mean and record those differences in 
the next 4 (= 2 n ~ l ) positions. 

Step 2: Repeat step 1 with the numbers recorded in step 1 (column 1). 

Step 3: Repeat step 1 with the numbers obtained in step 2 (column 2); the 
numbers obtained are 8 M, 4A, 4B, ... ， 4ABC, that is, apart from 
a constant the estimates of the mean response and of effects and 
interactions. 
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The three steps in terms of the formal expressions of the treatment means (leaving 
off the bar for convenience) are indicated below: 


Step 


D 

1 

2 

3 

Estimate 

(i) 

ci \ 

(a + \){b 1) 

(a+ 1)(Z? + l)(c+ 1) 

8M 

a 

b(a + 1) 

(a + l)(b + \)c 

{a — \)(b l)(c + 1) 

4A 

b 

ac(a + 1) 

(a — \){b + 1) 

(a + 1)(Z? — l)(c + 1) 

AB 

ab 

bc(a + 1) 

{a — l)(b + l)c 

{ci — \){b — l)(c + 1) 

4AB 

c 

ci _ 1 

(a+ 1)0-1) 

(a + 1)(Z? + l)(c — 1) 

4C 

ac 

b(a — 1) 

(a + \){b — \)c 

{a — 1)(Z?+ l)(c — 1) 

4AC 

be 

ac(a — 1) 

(a - \){b - 1) 

(a + l)(b — l)(c — 1) 

ABC 

abc 

bc(a — 1) 

(a — \){b — \)c 

(a-l)(b-l)(c-l) 

4ABC 


For the 2 n factorial this algorithm requires n steps and at the last step we 
obtain an estimate of 2 n ~ l A°^ 1 A^ 2 - - - A^ n in standard order. 

An inverse algorithm (Good, 1958) can be used to express the treatment com¬ 
bination responses in terms of effects and interactions. The mean, effects, and 
interactions are arranged in reverse standard order, and then we proceed exactly 
as in the Yates algorithm. We shall illustrate this again for the 2 3 factorial: 


Step 


0 


1 

2 

3 

(a - l)(b - l)(c - l) 

= AABC 

2a(b- l)(c- 1 ) 

4ab(c - 1) 

8abc 

{a + 1 )( 办 — l)(c — 1 ) 

= ABC 

2a(b+l)(c- 1 ) 

4ab(c -h 1) 

8bc 

{ci — \){b l)(c — 1 ) 

= 4AC 

2a(b-l)(c-hl) 

4b(c - 1) 

Sac 

(a + 1 )(Z? + l)(c — 1 ) 

= 4C 

2a(b+l)(c+l) 

4b(c-hl) 

8 c 

(a — \){b — l)(c+ 1 ) 

= 4AB 

2(b-l)(c-l) 

4a (c — 1) 

%ab 

{ci + \){b — l)(c+ 1 ) 

= 4B 

2(b+l)(c-l) 

4a(c + 1) 

Sb 

(a — 1 )(^?+ l)(c+ 1 ) 

= 4A 

2(b-l)(c+l) 

4(c - 1) 

Sa 

{ci + 1 )(Z?+ l)(c+ 1 ) 

= 8M 

2(b+l)(c+l) 

4(c+l) 

8 ( 1 ) 


This shows that if we start at step 0 with ^ABC, \ BC ,..., M we obtain 
the treatment combination response averages in reverse standard order. This 
method thus provides an easy way of estimating the residuals. 

A modification of the Yates algorithm has been given by Riedwyl (1998) with 
a view toward applications for systems of confounding (Chapters 8 and 9) and 
fractional factorials (Chapter 13). 
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7.6.3 Variances of Estimators 

Returning now to the estimation of effects and interactions, we know that each 
effect and interaction is estimated as 

2/—1 [( smn of 2” - 1 response means) 

— (sum of 2 n ~ l response means)] 

where each response mean is based on r replications. Since each observation has 
variance and since all the observations can be treated as being uncorrelated 
(see the argument in 1.6.3.4), we obtain 


var ( fa ) = 2 _ 

for all E a = A^A^ 2 - - - A^ n . Since all the effects and interactions are orthogonal 
to each other, so are their estimates, and hence the variance of a partial inter¬ 
action (see Section 7.5) consisting of a linear combination of m interactions is 
mallrT- 1 . 。 

7.6.4 Analysis of Variance 

To estimate the variance (7.62) and that of partial interactions we need to estimate 
This is accomplished through the ANOVA associated with the basic model 
(7.58) as outlined in Chapter 1.4. Such an ANOVA can, of course, be written out 
explicitly for models (7.59), (7.60), and (7.61) according to the general rules for 
such error control designs discussed in Chapters 1.6, 1.9, and 1.10, respectively. 
In any case, MS (Error) is an estimate of o^. 

We shall comment briefly on the sum of squares due to treatments, 
SS(Treatments) say, which in the present situation has t — \ = 2 n — \ d.f. Using 
the fact that effects and interactions represent 2 n — l orthogonal comparisons 
of the treatment effects, we can partition SS (Treatments) into 2 n — l orthogonal 
1-d.f. sums of squares associated with these comparisons (see Chapter 1.7). We 
denote the sum of squares associated with E a = A^A^ 2 - - - by SS(E a ), and 
according to the general rules (see 1.7.2.3) 


using (7.62). 


SS(£ a ) 


(E a f 


var ( E a 


)k 2 


r2 n ~ 


E a 


(7.63) 
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Another way of writing SS ( 五 a )，a generalization of which will be 
later chapters, is as follows. We recall from (7.41) that 

E a = (-l)E a -' (£« - £«) 

In terms of observed quantities we then write 

E a = (-l)S«i (£« _ £«) 

It follows then from (7.63) that 

SS (E a ) = rr_ 2 ( 玲 - £f ) 2 

Using the fact that Eq + = 0 we can rewrite (7.64) as 

SS (E a ) 二 r2 n ~ 2 -E^ + E^ + E^f 

= (£«) 2 
or 

SS (E a ) = r2 n ~ 2 — 玲 — £f ] 2 

=rT (ff ) 2 
or 

SS (E a ) = \r2 n [ 尚 ) 2 + (£f ) 2 

= r 2»->[(£«) 2 + (^) r 

Now 

E [E a f = var (f a ) + [E (E a )f 

= ^e+rr~ 2 [E«] 2 

so that, using (7.63) and (7.66 )， 

E[SS (£ a )] = a] + r2' l ~ 2 [E a f 

Under the GMNLM we can then test 


H 0 : E a =0 

by means of the F test (as an approximation to the randomization test) 

F _ SS(E a ) 

_ MS (Error) 


used in 


(7.64) 


(7.65) 


(7.66) 


(7.67) 
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Table 7.1 ANOVA for 2 3 Factorial in RCBD 


Source 

d.f. 

SS 

E(MS) 

Blocks 

r- 1 

sEu ....) 1 2 


Treatments 

7 

i 

jkt 


A 

1 

2r[A] 2 

(T e 2 + 2r[A] 2 

B 

1 

2r[B] 2 

a ； + 2r[B] 2 

AB 

1 

2r[AB] 2 

(T c 2 + 2r[AB] 2 

C 

1 

2r[C] 2 

cr ； + 2r[C] 2 

AC 

1 

2r[AC] 2 

or, 2 + 2r[AC] 2 

BC 

1 

2r[BCf 

or e 2 + 2r[BC] 2 

ABC 

1 

2r[ABC] 2 

a; + 2r[ABC] 2 

Error 

7(r - 1) 

y^Xyjjki — Yi …— y.ju + 歹 … .) 2 
ijkl 

〜 2 

Total 

8r- 1 

J2 { yuke-y...) 2 

ijU 



As an example of the present discussion we give in Table 7.1 the ANOVA of 
a 2 3 experiment in a RCB design with r blocks, using the model equation 

yijkt = /x + ^/ + Tju + e"u (7.68) 

7.6.5 Numerical Examples 

To illustrate some aspects of the theoretical developments in the previous sections, 
we shall consider two numerical examples. The analysis will be performed by 
using SAS PROC GLM and PROC MIXED (SAS Institute, 1999-2001), and we 
shall provide some comments on the output. 

Example 7.2 Consider a 2 3 factorial in an RCBD with r = 2 blocks. The 
data and the analysis, using the model equation (7.68), are given in Table 7.2. 

Although most of the output is self-explanatory we shall make the following 
comments: 

1. Estimates of the main and interaction effects can be obtained by using the 
ESTIMATE option. The coefficients for the contrasts are obtained by uti¬ 
lizing the relationship between the various parameterizations, as explained 
in Section 7.2.3. 

2. The estimates of the effects are, of course, obtained as linear contrasts 

among the LSMEANS using the coefficients as specified in the ESTIMATE 
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Table 7.2 Data and Analysis for 2 3 Factorial in a RCBD 

options pageno=l nodate ； 
data example1 ； 
input block A B C y @@； 
datalines ； 

1000 13 1100 20 1010 14 1110 17 

1001 21 1101 25 1011 19 1111 22 

2000 24 2100 27 2010 28 2110 31 

2001 35 2101 39 2011 34 2111 40 


run ； 


proc print data=examplel ； 

titlel 'TABLE 7.2'; 

title2 'DATA FOR 2**3 FACTORIAL'; 

title3 'IN A RCBD'; 

run ； 


proc glm data=examplel ； 
class block A B C ； 
model y=block A|B|C ； 

lsmeans ABC A*B A*C B*C A*B*C/stderr ； 

contrast 'Main effect A' A -1 1 ； 

estimate 'Main effect A' A -1 1 ； 

contrast 'Interaction AB' A*B .5 - .5 - .5 .5; 

estimate 'Interaction AB' A*B .5 - .5 - .5 .5; 

contrast 'Interaction ABC' A*B*C -.25 .25 .25 - .25 .25 -.25 -.25 .25; 

estimate 'Interaction ABC' A*B*C -.25 .25 .25 - .25 .25 -.25 -.25 .25; 

titlel 'ANALYSIS OF 2**3 FACTORIAL'; 
title2 'IN A RCBD'; 
run ； 


Obs block ABC y 


1 1 

2 1 

3 1 

4 1 

5 1 

6 1 

7 1 

8 1 

9 2 

10 2 

11 2 

12 2 

13 2 

14 2 


0 0 0 13 
1 0 0 20 
0 1 0 14 
1 1 0 17 
0 0 1 21 
1 0 1 25 
0 1 1 19 
1 1 1 22 
0 0 0 24 
1 0 0 27 
0 1 0 28 
1 1 0 31 
0 0 1 35 
1 0 1 39 
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Table 7.2 (Continued) 


2 0 

1 1 

34 

2 1 

1 1 

40 

ANALYSIS OF 2**3 FACTORIAL 

IN A RCBD 

The GLM Procedure 

Class 

Level Information 

Class 

Levels 

Values 

block 

2 

12 

A 

2 

0 1 

B 

2 

0 1 

C 

2 

0 1 

Number of observations 

The GLM Procedure 

16 


Dependent Variable : y 


Sum of 


Source 

DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 

8 

1026.000000 

128.250000 

24.98 

0.0002 

Error 

7 

35.937500 

5.133929 



Corrected 

Total 15 

1061.937500 





R-Square 

Coeff Var 

Root MSE 

y Mean 



0.966159 

8.863833 

2.265817 

25.56250 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

block 

1 715.5625000 

715.5625000 

139.38 

<.0001 

A 

1 

68.0625000 

68.0625000 

13.26 

0.0083 

B 

1 

0.0625000 

0.0625000 

0.01 

0.9152 

A*B 

1 

0.5625000 

0.5625000 

0.11 

0.7503 

C 

1 232.5625000 

232.5625000 

45.30 

0.0003 

A*C 

1 

0.0625000 

0.0625000 

0.01 

0.9152 

B*C 

1 

7.5625000 

7.5625000 

1.47 

0.2642 

A*B*C 

1 

1.5625000 

1.5625000 

0.30 

0.5983 
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Table 7.2 {Continued) 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

block 

1 

715.5625000 

715.5625000 

139.38 

< . 0001 

A 

1 

68.0625000 

68.0625000 

13.26 

0.0083 

B 

1 

0.0625000 

0.0625000 

0.01 

0.9152 

A*B 

1 

0.5625000 

0.5625000 

0.11 

0.7503 

C 

1 

232.5625000 

232.5625000 

45.30 

0.0003 

A*C 

1 

0.0625000 

0.0625000 

0.01 

0.9152 

B*C 

1 

7.5625000 

7.5625000 

1.47 

0.2642 

A*B*C 

1 

1.5625000 

1.5625000 

0.30 

0.5983 





Least Squares Means 






Standard 



A 


y LSMEAN 

Error 

Pr > |t| 


0 


23.5000000 

0.8010874 

<.0001 


1 


27.6250000 

0.8010874 

<.0001 





Standard 



B 


y LSMEAN 

Error 

Pr > |t| 


0 


25.5000000 

0.8010874 

<.0001 


1 


25.6250000 

0.8010874 

< . 0001 





Standard 



C 


y LSMEAN 

Error 

Pr > |t| 


0 


21.7500000 

0.8010874 

<.0001 


1 


29.3750000 

0.8010874 

< . 0001 





Standard 


A 


B 

y LSMEAN 

Error 

Pr > |t| 

0 


0 

23.2500000 

1.1329087 

< . 0001 

0 


1 

23.7500000 

1.1329087 

< . 0001 

1 


0 

27.7500000 

1.1329087 

< . 0001 

1 


1 

27.5000000 

1.1329087 

< . 0001 





Standard 


A 


c 

y LSMEAN 

Error 

Pr > |t| 

0 


0 

19.7500000 

1.1329087 

< . 0001 

0 


1 

27.2500000 

1.1329087 

< • 0001 

1 


0 

23.7500000 

1.1329087 

< . 0001 

1 


1 

31.5000000 

1.1329087 

< . 0001 
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Table 7.2 (Continued) 








Standard 




B 

C 



y LSMEAN 

Error 

Pr > |t| 



0 

0 



21.0000000 

1.1329087 

< . 0001 



0 

1 



30.0000000 

1.1329087 

< . 0001 



1 

0 



22.5000000 

1.1329087 

< . 0001 



1 

1 



28.7500000 

1.1329087 

< . 0001 








Standard 



A 


B 


c 

y LSMEAN 

Error 

Pr > |t| 


0 


0 


0 

18.5000000 

1.6021749 

<.0001 


0 


0 


1 

28.0000000 

1.6021749 

<.0001 


0 


1 


0 

21.0000000 

1.6021749 

<.0001 


0 


1 


1 

26.5000000 

1.6021749 

<.0001 


1 


0 


0 

23.5000000 

1.6021749 

<.0001 


1 


0 


1 

32.0000000 

1.6021749 

<.0001 


1 


1 


0 

24.0000000 

1.6021749 

<.0001 


1 


1 


1 

31.0000000 

1.6021749 

<.0001 


Dependent Variable : 

y 






Contrast 


DF 


Contrast SS 

Mean Square 

F Value 

Pr > F 

Main effect 

A 


1 


68.06250000 

68.06250000 

13.26 

0.0083 

Interaction 

AB 


1 


0.56250000 

0.56250000 

0.11 

0.7503 

Interaction 

ABC 


1 


1.56250000 

1.56250000 

0.30 

0.5983 





Standard 



Parameter 


Estimate 

Error 

t Value 

Pr > |t| 

Main effect 

A 

4.12500000 

1.13290871 

3.64 

0.0083 

Interaction 

AB 

-0.37500000 

1.13290871 

-0.33 

0.7503 

Interaction 

ABC 

0.62500000 

1.13290871 

0.55 

0.5983 


statements. Recall that for an orthogonal design the least-squares means 
are just the ordinary means. Thus, we have, for example, 

AB = ^[LSMEAN(^o^o) - LSMEAN(a 0 Z?i) 

- LSMEAN(aiZ? 0 ) + LSMEAN(ai^i)] 

= \ (y.oo- — y oi- ~ y-io- +y.ii.) 

= ^(23.25 - 23.75 - 27.75 + 27.5) 

=-.375 (7.69) 
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3. The E? terms, as introduced in Section 7.4.2, can easily be obtained from 
the LSMEANS. For example, 

g 110 ) = 士 [LSMEAN(ao^o) + LSMEAN(a 山 )] 

— overall mean 

=\ (y.oo- + y.ii.) _ 歹 … 

=\ (23.25 + 27.50) - 25.5625 
=-.1875 
and 

£} 110) = .1875 

4. From the estimate in comment 2 and (7.63)we obtain 

SS(AB) = 2.2. (—.375 ) 2 = .5625 

The same value is, of course, obtained in the ANOVA table and through 
the CONTRAST statement. 

5. The standard error (SE) for the estimate of an effect is provided through the 
ESTIMATE statement. It can also be obtained directly from the information 
provided for the LSMEANS. For example, making use of (7.69), we obtain 

var(A5) = ^ -4var [LSMEAN(a 0 ^o)] 

=(1.133) 2 = 1.283 

and hence 

SE(AB) = yvar(AB) = 1.133 

The same result can be obtained by substituting in (7.62) the estimate for 
namely = MS (Error) = 5.134 from the ANOVA table, 


SE(AB) = 5.134 = 1.133 □ 


Example 7.3 The following is an example of combining all three elements 
of an experimental design, namely error control design, treatment design, and 
sampling design (see 1.2.23 and 1.3). More specifically, we consider a 2 3 factorial 
in an RCBD with 〆 = 2 blocks and subsampling (n = 2). The data and the 
analysis are given in Table 7.3. 



ANALYSIS OF FACTORIAL EXPERIMENTS 


273 


Table 7.3 Data and Analysis for 2 3 Factorial in a RCBD With Subsampling 

options pageno=l nodate ； 
data example2 ； 
input block A B C y @@； 
datalines ； 

1000 13 1000 14 1100 20 1100 22 

1010 14 1010 11 1110 17 1110 17 

1001 21 1001 22 1101 25 1101 23 

1011 19 1011 20 1111 22 1111 23 

2000 24 2000 26 2100 27 2100 26 

2010 28 2010 27 2110 31 2110 32 

2001 35 2001 37 2101 39 2101 39 

2011 34 2011 35 2111 40 2111 42 


run ； 

proc print data=example2 ； 

titlel 'TABLE 7.3'; 

title2 'DATA FOR 2**3 FACTORIAL'; 

title3 'IN A RCBD WITH SUBSAMPLING'; 

run ； 

proc glm data=example2 ； 

class block A B C ； 

model y=block A|B|C block*A*B*C ； 

test H=A B C A*B A*C B*C A*B*C E=block*A*B*C; 

titlel 'ANALYSIS OF 2**3 FACTORIAL'; 

title2 'IN A RCBD WITH SUBSAMPLING'; 

run ； 

proc mixed data=example2 ； 

class block A B C ； 

model y=block A|B|C ； 

random block*A*B*C ； 

estimate 'Main effect A' A -1 1 ； 

estimate 'Interaction AB 1 A*B .5 -.5 - .5 .5 ； 

estimate 'Interaction ABC' A*B*C -.25 .25 .25 -.25 .25 -.25 - .25 .25; 
run ； 


Obs block ABC y 


1 

2 

3 

4 

5 

6 
7 


0 0 0 13 
0 0 0 14 
1 0 0 20 
1 0 0 22 
0 1 0 14 
0 1 0 11 
1 1 0 17 
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Table 7.2 


Dependent 

Source 

Model 


{Continued) 


8 1 

9 1 

10 1 

11 1 

12 1 

13 1 

14 1 

15 1 

16 1 

17 2 

18 2 

19 2 

20 2 

21 2 

22 2 

23 2 

24 2 

25 2 

26 2 

27 2 

28 2 

29 2 

30 2 

31 2 

32 2 


1 1 
0 0 
0 0 
1 0 
1 0 
0 1 
0 1 
1 1 
1 1 
0 0 
0 0 
1 0 
1 0 
0 1 
0 1 
1 1 
1 1 
0 0 
0 0 
1 0 
1 0 
0 1 
0 1 
1 1 
1 1 


0 17 

1 21 

1 22 

1 25 

1 23 

1 19 

1 20 

1 22 

1 23 

0 24 

0 26 

0 27 

0 26 

0 28 

0 27 

0 31 

0 32 

1 35 

1 37 

1 39 

1 39 

1 34 

1 35 

1 40 

1 42 


ANALYSIS OF 2**3 FACTORIAL 
IN A RCBD WITH SUBSAMPLING 

The GLM Procedure 

Class Level Information 


Class 

Levels 

Val 

block 

2 

12 

A 

2 

0 1 

B 

2 

0 1 

C 

2 

0 1 


Number of observations 32 
Variable : y 

Sum of 

DF Squares Mean Square F Value Pr > F 

15 2244.968750 149.664583 129.44 <.0001 
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Table 7.3 (Continued) 


Error 16 18.500000 1.156250 

Corrected Total 31 2263.468750 


R-Square 

Coeff Var 

Root MSE 

y Mean 

0.991827 

4.170824 

1.075291 

25.78125 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

block 

1 

1498.781250 

1498.781250 

1296.24 

<.0001 

A 

1 

132.031250 

132.031250 

114.19 

<.0001 

B 

1 

0.031250 

0.031250 

0.03 

0.8715 

A*B 

1 

1.531250 

1.531250 

1.32 

0.2667 

C 

1 

504.031250 

504.031250 

435.92 

<.0001 

A*C 

1 

0.781250 

0.781250 

0.68 

0.4232 

B*C 

1 

3.781250 

3.781250 

3.27 

0.0894 

A*B*C 

1 

2.531250 

2.531250 

2.19 

0.1584 

block*A*B*C 

7 

101.468750 

14.495536 

12.54 

<.0001 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

block 

1 

1498.781250 

1498.781250 

1296.24 

<.0001 

A 

1 

132.031250 

132.031250 

114.19 

<.0001 

B 

1 

0.031250 

0.031250 

0.03 

0.8715 

A*B 

1 

1.531250 

1.531250 

1.32 

0.2667 

C 

1 

504.031250 

504.031250 

435.92 

<.0001 

A*C 

1 

0.781250 

0.781250 

0.68 

0.4232 

B*C 

1 

3.781250 

3.781250 

3.27 

0.0894 

A*B*C 

1 

2.531250 

2.531250 

2.19 

0.1584 

block*A*B*C 

7 

101.468750 

14.495536 

12.54 

<.0001 


Tests of 

Hypotheses Using 

the Type III 

MS for block*A*B*C 

as an Error Term 

Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

A 

1 

132.0312500 

132.0312500 

9.11 

0.0194 

B 

1 

0.0312500 

0.0312500 

0.00 

0.9643 

C 

1 

504.0312500 

504.0312500 

34.77 

0.0006 

A*B 

1 

1.5312500 

1.5312500 

0.11 

0.7547 

A*C 

1 

0.7812500 

0.7812500 

0.05 

0.8231 

B*C 

1 

3.7812500 

3.7812500 

0.26 

0.6252 

A*B*C 

1 

2.5312500 

2.5312500 

0.17 

0.6886 
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Table 7.3 {Continued) 


The Mixed Procedure 
Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Estimation Method 
Residual Variance Method 
Fixed Effects SE Method 
Degrees of Freedom Method 


W0RK.EXAMPLE2 

y 

Variance Components 

REML 

Profile 

Model-Based 

Containment 


Class Level Information 


Class 

Levels 

Values 

block 

2 

1 

2 


A 

2 

0 

1 


B 

2 

0 

1 


C 

2 

0 

1 



Dimensions 




Covariance Parameters 



2 

Columns 

in X 



29 

Columns 

in Z 



16 

Subjects 



1 

Max Obs 

Per Subject 



32 

Observations Used 



32 


Observations Not Used 0 

Total Observations 32 


Iteration History 


Iteration 


Evaluations 


-2 Res Log Like 


Criterion 


0 1 116.43092171 

1 1 99.48076616 0.00000000 


Convergence criteria met. 

Covariance Parameter 
Estimates 


Cov Parm 


Estimate 


block*A*B*C 

Residual 


6.6696 
1.1563 
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Table 7.3 (Continued) 


Fit Statistics 


-2 Res Log Likelihood 99.5 
AIC (smaller is better) 103.5 
AICC (smaller is better) 104.1 
BIC (smaller is better) 105.0 


Type 3 Tests of Fixed Effects 



Num 

Den 



Effect 

DF 

DF 

F Value 

Pr > F 

block 

1 

7 

103.40 

< . 0001 

A 

1 

7 

9.11 

0.0194 

B 

1 

7 

0.00 

0.9643 

A*B 

1 

7 

0.11 

0.7547 

C 

1 

7 

34.77 

0.0006 

A*C 

1 

7 

0.05 

0.8231 

B*C 

1 

7 

0.26 

0.6252 

A*B*C 

1 

7 

0.17 

0.6886 


Label 


Estimate 

Estimates 

Standard 

Error 

DF 

t Value 

Pr > |t| 

Main effect 

A 

4.0625 

1.3461 

7 

3.02 

0.0194 

Interaction 

AB 

0.4375 

1.3461 

7 

0.33 

0.7547 

Interaction 

ABC 

0.5625 

1.3461 

7 

0.42 

0.6886 


The analysis is based on the model 

yijkim = M ^jki H - jki H - ^lijktm 


(see 1.6.9.1). 

The following comments are intended to relate the output to the theoretical 
developments: 

1. SAS PROC GLM is being used only to obtain an ANOVA table. Because 
of subsampling it is necessary to explicitly define the experimental error 
as block * A * B * C in order to perform the correct tests about all effects, 
using MS(X)/MS(block * A * B * C), where X represents any of the seven 
effects and MS (Experimental error) = MS (block * A * 召 * C) represents 
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technically speaking, the 7 d.f. associated with the seven 1-d.f. interactions 
block * X. 

2. Specifying the correct error term is not necessary when using SAS PROC 
MIXED because the random term block * A * B * C automatically assumes 
that role. This can be verified by comparing the tests performed by PROC 
GLM and PROC MIXED (note, however, that the test about block effects 
should be ignored; see 1.9.2.6). 

3. PROC MIXED provides REML estimates of c f 2 and (PROC GLM 
provides an estimate of only). We obtain 

of = 6.6696 = 1.1563 

4. In estimating the various effects, and their standard errors as illustrated 
here for A, AB, and ABC, by means of the ESTIMATE statement, PROC 
MIXED uses the correct error term, namely MS (Experimental error). 

5. The above statement is true also for the LSMEANS, and their standard 

error. □ 


7.6.6 Use of Only One Replicate 

In the previous section we have assumed that each treatment combination is repli¬ 
cated r times in a certain error control design such as a CRD or a RCBD. In any 
case we usually have r > 1, according to the general principles of experimental 
design. One can visualize, however, that in a 2 n factorial experiment (particularly 
an exploratory-type experiment) the number of treatment combinations can be 
rather large, in fact so large that the experimenter cannot afford to replicate the 
treatments. How should we deal with such a situation? 

Even with only one replicate, that is, r = 1, we can, of course, estimate all 
effects and interactions in the usual way. We cannot, however, estimate their 
variances as there are no d.f. left for error in the ANOVA. We then recall (see 
Chapter 1.11) that many-factor, that is, higher order, interactions are often negli¬ 
gible. Assuming that they are actually zero, it follows then from (7.67) that the 
expected value of their sums of squares is o^. The idea then is to pool all sums 
of squares associated with interactions assumed (a priori) to be negligible and 
treat the resulting sum of squares as SS(Error) for analysis purposes. Even if the 
interactions are not zero (and they seldom are), this procedure will result in a 
conservative F test for the remaining effects and interactions. 



CHAPTER 8 


Confounding in 2 n Factorial Designs 


8.1 INTRODUCTION 

It is well known that the experimental error variance is related to the size of the 
block, increasing to a greater or lesser degree as the block size increases. If we 
were testing 5 factors each at 2 levels, we would have 32 treatment combinations 
and hence, if we were to use a randomized complete block design, we would 
require blocks of size 32 in order to compare them. In field experiments it is 
generally acknowledged that, if at all possible, the size of the block should 
not be greater than 16 and if possible 8, though, of course, there are no hard 
and fast rules, and one experiment with blocks of 16 EUs may well give a 
lower experimental error than another on similar material with blocks of 8 EUs. 
In other situations, as, for example, in industrial experimentation, it may be 
impossible to have blocks as large as this without randomizing over factors that 
contribute considerable variation, thereby causing large experimental error. Or in 
experiments with animals (e.g. mice), the litter size determines and hence limits 
the block size. It is desirable, therefore, to have some means of reducing the block 
size or of making use of blocks of smaller size (i.e., incomplete blocks), and for 
this purpose the device of confounding has been introduced. It is a fundamental 
concept in the theory and application of factorial experiments. 

8.1.1 A Simple Example 

We have introduced the concept of confounding certain interactions (or effects, 
in general) with block effects in Section 1.11.5. The basic idea is that information 
about unimportant, that is, usually negligible interactions is sacrificed in the sense 
that only biased estimates of the confounded interactions are available, and the 
bias is a linear function (contrast) of block effects. For example, for the 2 3 
factorial (with factors A, B, C) with blocks of size 4, we might confound the 
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3-factor interaction, ABC, with blocks that lead to the arrangement 

Block 1 (1), ab, ac, be 

Block 2 a, b, c, abc 


( 8 . 1 ) 


Since for each observation we have derived a linear model of the form 


Observation = mean + block effect + treatment effect + error (8.2) 


(see Sections 1.9.2 and 1.3) and since block 1 contains all the treatment combi¬ 
nations that enter negatively into ABC (see Section 7.3) and block 2 contains 
those entering positively, we have 

E(ABC) = ABC Pi) (8.3) 


where 0i, 02 are the block effects, in other words, the interaction ABC and the 
difference of block effects, — fiu cannot be separated. We say, for short, that 
ABC is confounded with blocks. 


8.1.2 Comparison of Information 


As we have mentioned earlier (see Section 1.11.2) the main value of the facto¬ 
rial structure lies in the fact that in many situations the high-order interactions 
are of inappreciable magnitude. If in the situation described above the 3-factor 
interaction can be assumed to be negligible, then the basic arrangement (8.1) is 
quite satisfactory since it allows the unbiased estimation of all main effects and 
2-factor interactions. For practical purposes, however, one would have to repeat 
the basic arrangement (8.1)，say r times (using, of course, a different randomiza¬ 
tion every time). Thus, with r replications for each treatment combination, we 
can compare the information on each effect and interaction obtained from the 
present design with that obtained from a design without confounding, that is, the 
RCBD. Using the reciprocal of the true variance of each estimator as a measure 
of the information on each effect and interaction, and denoting by the true 
error variance [of the error in model (8.2)] with blocks of size k, we have the 
results of Table 8.1. Since generally we have increased the information 

on main effects and 2-factor interactions in the ratio cr^/cr^ by using blocks of 
size 4 at the expense of obtaining zero information on the 3-factor interaction 
instead of 2r/a| units. 

8.1.3 Basic Analysis 

Before discussing the problem of confounding and appropriate systems of con¬ 
founding more generally and more formally, we shall conclude this introductory 
section by outlining briefly the analysis of the experiment discussed above. As 
indicated earlier, all main effects and 2-factor interactions are estimated in the 
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Table 8.1 Comparison of Information 


Effect or 

Information with Blocks of 

Interaction 

8 Units 

4 Units 

A 



B 



AB 


2r 

C 

. 2r 


AC 

BC 

a i 

ABC 


Zero 


usual way based upon the average responses from the r replicates. We can, for 

example, use the Yates algorithm and obtain the estimates A, B, AB, C, AC, BC. 

- - 

We also obtain ABC, but we know already that it is an estimate of ABC plus 
a function of block effect differences. In order to distinguish it from unbiased 
estimates we shall denote it by ABC rather than ABC. 

For purposes of writing out the analysis of variance we rewrite model (8.2) 
more explicitly as 


yijkim = M + P/y + ^kim H - jktm (^-4) 

where ^ is the effect of the jth block in the ith replicate (i = 1, 2,..., r; j = 
1,2), rum is the effect of the treatment combination akbic m {k, m = 0, 1), and 
^ijUm is the usual error, that is, experimental and observational error. 

We then have the usual breakdown of the total sum of squares into the block 
sum of squares, the treatment sum of squares, and the error sum of squares as 
exhibited in Table 8.2. The treatment sum of squares is partitioned in the usual 
way (as explained in Chapter 7), except that there is no SSabc. 

We close this section with a few remarks about some of the features of 
Table 8.2: 

1. Due to the confounding of ABC we have only 6 d.f. for treatments, which 
means that this incomplete block design is a disconnected design (see 
Section 1.5). It is, of course, disconnected by choice. 

2. The sums of squares for estimable (and unconfounded) main effects and 
interactions are obtained as if an RCBD had been used, that is, by linear 
combinations of treatment means. This is contrary to what would have 
happened had we assigned the treatment combinations in other ways to 
existing incomplete block designs, for example, BIB or PBIB designs. In 
that case the method described in Section 1.3 would have to be used, that 
is, treatment means would have to be replaced by least-squares means. 

3. The statement above implies that the estimable effects are orthogonal to 
block effects; that is, we have, for example, that SS(A|Blocks) = SS(A). 
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Table 8.2 Analysis of Variance Table for 2 3 Factorial in Blocks of Size 4 


Source 


Sum 

Expected 

of Variation 

d.f. 

of Squares 

Mean Squares 

Blocks 

2r - 1 

iy V 2 . 

4 乙少 "… 8r 

ij 


Replicates 

r- 1 

i 2 y 2 -- 

8 


ABC 

1 

2 r[ABC ] 2 


Remainder 

r- 1 

Difference 


Treatments 

6 



A 

1 

2 r[A ] 2 

〜 2 + 2 r[A ] 2 

B 

1 

2 r[B ] 2 

〜 2 + 2 r[B ] 2 

AB 

1 

2 r[AB ] 2 

cr^ + 2 r[AB ] 2 

C 

1 

2 r[C ] 2 

cr^ + 2 r[C ] 2 

AC 

1 

2 r[AC ] 2 

〜 2 + 2 r[AC ] 2 

BC 

1 

2 r[BC ] 2 

〜 2 + 2 r[BC ] 2 

Error 

6(r - 1) 

Difference 


Total 

8r - 1 

J 2 yfjktm - 




i jklm 



4. Although it is not important at this point but will become important later 
(see Section 8.7), SS(Blocks) can be partitioned in various ways. Recall 
that we have r replicates, each consisting of two blocks. We therefore have 

SS (Blocks) = SS (Replicates) + SS (Blocks/Replicates) 


with r — 1 and r d.f., respectively. Now, the makeup of the two blocks 
in each replicate is the same, that is, one block contains the treatment 
combinations (1) ， ab ， ac, be and the other block contains a, b, c, abc. 
Apart from block effects, the comparison between the two blocks is an 
“estimate” of the (confounded) interaction ABC. Hence we can write 


SS (Blocks/Replicates) = SS(ABC) + SS (Remainder) 


where SS(ABC) is obtained in the usual way (see remark 2 above). 

5. Although we indicate that SS (Error) and SS (Remainder) can be obtained 
by subtraction, there is also a direct way of computing them. Each main 
effect and 2-factor interaction can be estimated (unbiasedly) from the r 
replicates and comparisons among the r estimates of a particular effect 
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provide a measure of the interaction between replicates and the effect. 
These interactions are, at least for our current discussion, assumed to be 
negligible. Hence, these comparisons “belong to error” (see 1.4.17.1) and 
the corresponding sums of squares, for example, SS(A x reps), are part of 
SS(Error). Each such “interaction” sum of squares has r — 1 d.f. Since we 
have six unconfounded effects we have six such sums of squares and hence 
6(r — 1) d.f. for SS (Error). The same comment applies to SS (Remainder), 
which, technically speaking, is equal to SS(ABC x reps). For a more pre¬ 
cise discussion see Section 8.5.3. 


8.2 SYSTEMS OF CONFOUNDING 


In this section we present a formal method of constructing systems of con¬ 
founding when the block sizes are equal to a power of 2. Such systems will be 
obtained by considering different partitions of the factor space as represented by 
corresponding systems of equations. 

8.2.1 Blocks of Size 2"- 1 

The general feature of the example discussed in the previous section is that the 
block size is ^ the number of treatment combinations. As a consequence two 
blocks are necessary to accommodate all possible treatment combinations, and 
hence not all treatment contrasts are estimable. Through proper arrangement of 
the treatment combinations in the blocks, it can be achieved that the nonestimable 
contrast (i.e.，the lost degree of freedom) is of minor or no importance. 

For the 2 n factorial with blocks of size k = 2 n ~ l this means that we can 
confound any interaction A°^ 1 A^ 2 - - - An n with blocks, and the basic arrangement 
is obtained by assigning the set of treatment combinations satisfying 


a\x\ + a ， 2 X 2 + -- 

• + ~ 0 

mod 2 

to one block and the set satisfying 



a\X\ + (X2X2 + -- 

• + = 1 

mod 2 


to the other block. Typically, one would choose the interaction with all oti = 
1(/ = 1 , 2 ,..., n), that is, the highest-order interaction. 

8.2.2 Blocks of Size 2 n ~ 2 

In many practical situations blocks of size 2 n ~ l may still be too large or may 
not be available. From a factorial point of view the next smaller block size is 
then k = 2 n ~ 2 . In this case four blocks are needed for the 2 n treatment combi¬ 
nations. It is obvious then that the 3 d.f. due to block comparisons are “lost” 
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as far as treatment comparisons are concerned. With a proper allocation of the 
treatment combinations to the blocks this means that three interactions will not 
be estimable, that is, will have to be confounded with blocks. To obtain an appro¬ 
priate arrangement, we assume first that the interaction E a = A^ 1 A^ 2 - - - is 
negligible, that is, can be confounded with blocks. Based on the equations 

a\x\ + a 2 X 2 + - h oi n x n = 0 mod 2 (8.5) 

a\x\ + a ， 2 X 2 + - h oi n x n = 1 mod 2 (8.6) 

we divide the 2 n treatment combinations into two sets of 2 n ~ l treatment combi¬ 
nations each. We now assume further that the interaction A^ 2 - - - A^ n 

is also negligible. We can then subdivide the set of treatment combinations into 
two sets of 2 n_1 based on the equations 

P\x\ + P 2 X 2 + ... + PnX n = 0 mod 2 (8.7) 

P\X\ + + .. • + p n x n = 1 mod 2 (8.8) 

Because of orthogonality, we know that among the 2 卜 1 treatment combinations 
that satisfy (8.5), exactly 2 n — 2 satisfy also (8.7) and the remaining 2 n ~ 2 satisfy 
(8.8). The same statement can be made for the treatment combinations satisfying 
(8.6). This then gives us a method of dividing the treatment combinations into 4 
sets of 2 n ~ 2 treatment combinations, which are then assigned to the four blocks, 
that is, we consider 

Block 1: ^ a，iXi =0 

i 

=0 

i 

Block 2: ^ atXi = 0 

i 

= 1 

i 

Block 3: ^ aiXi = 1 

i 

=0 

i 

Block 4: ^ aiXi = 1 

i 

= 1 
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where all equations are reduced modulo 2 as usual. In terms of block comparisons 
E a and are defined (apart from a constant) as follows: 





Block 



1 

2 3 4 


■- A a n " 

+ 

+ -- 

W 2 .. 

.於 

+ 

-+ - 


The third orthogonal comparison is 



1 

Block 

2 3 

4 


. A a n ,,+ti ' 1 

+ 

— — 

+ 


which we recognize immediately as the GI of E a and satisfying the equations 


+ Pi)xi = 0 
i 

and 

+ Pi)xi = 1 


The block contrasts given above also indicate at the same time with which 
linear function of block effects each of the three interactions is confounded. 

The main result of this discussion, however, is that if two interactions E a 
and are each confounded with blocks then also their GI is confounded 
with blocks. This imposes a certain restriction on the choice of the design, that 
is, the allocation of the treatment combinations to the blocks. It may very well 
be the case that, although E 01 and are negligible, is not; in fact it may 

be a main effect or a low-order interaction. Hence care has to be exercised in 
choosing E 01 and . 


8.2.3 General Case 

The problem described above becomes even more acute if we have to resort 
to still smaller block sizes. Suppose then that we have available blocks of size 
k = 2 p (p < n) that require 2 n _ p blocks for the basic arrangement and hence the 
confounding of 2 n ~ p — 1 interactions with blocks. To find a suitable system of 
confounding, we choose n — p interactions E as represented by the n-tuples 

^ s = (%1 ， a s 2, . . . ， ^-sn) (8.9) 

with s = 1, 2,..., w — p, such that no a f s can be obtained as a linear combina¬ 
tion of the remaining 行 -tuples. We refer to this set of interactions as independent 
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interactions . Each set of equations associated with these n — p interactions, 
namely 

a\\x\ -\-a\2x2 + -\-a\ n x n = 5 i 

^ 21^1 + ^ 22^2 + • • • + 0 L 2 n X n = 82 

: ( 8 . 10 ) 
以《 —/ 7 ， 1*^1 + ^n-p,2-^2 + . . . + a n _p ， n X n = S n —p 

for a fixed right-hand side with 8 S = 0, l(s = 1,2, n — p) is satisfied by 
2 P treatment combinations (x\,X2, .. •, x n ). Since each 8 S can take on two dif¬ 
ferent values, there are altogether 2 n ~ p equations of the form (8.10) (with the 
a f s , s = 1, 2,..., n — p, remaining constant, of course). Any treatment combi¬ 
nation satisfying one set of equations cannot satisfy another set. Hence the 
2 n ~ p sets of 2 P treatment combinations comprise together all 2 n treatment com¬ 
binations and, moreover, each set makes up the treatment combinations for 
one block. 

Because of the nature of the equations (8.10)，the interactions E as are obvi¬ 
ously confounded with blocks. We have shown earlier that if two interactions 
are confounded with blocks then so is their GI. Similarly if three interactions, 
E Ul , E 012 , E 013 say, are confounded with blocks, then any treatment combination 
(x\,X2 ,..., x n ) that satisfies the equations 

〉 : ^li^i = $1 

i 

0i2iXi = 82 
i 

^2 a3iXi = 83 
i 

also satisfies the equations 

+ d2i)Xi =8\-\-82 
i 

+ ^ 3 /)^- = 83 

i 

^^21 + Ot 3 i)Xi =82 + 83 


+ 0i2i + Ot^i)Xi = 51+52 + ^3 
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which define the GIs £ ai+a2 , _E ai+a3 , _E a2+a3 ，and £' ai+a2+a3 , respectively, 
which are then also confounded with blocks. By induction it is then easy to 
see that ii n — p interactions are confounded with blocks so are their 



GIs. It follows then from (8.11) that altogether 2 n ~ p — 1 interactions (including 
the set ofn — p independent ones) are confounded with blocks, which is equal 
to the number of linearly independent comparisons among the 2 n ~ p blocks. 

This result emphasizes the important role played by the original set of n — p 
independent interactions. They must be chosen so that their GIs do not result in 
main effects and/or low-order interactions that are possibly nonnegligible. 

We shall illustrate this procedure with the following example. 

Example 8.1 Suppose we have six factors A, B, C, D, E, F and we consider 
blocks of size 8, that is, we have /i = 6, 2 6 = 64 treatment combinations, 2 P = 
8, n — p = 3, 2 n ~ p = 8. A possible set of three independent interactions is 

ABCD, CDE, BDF 


Their GIs are 

(ABCD){CDE) = ABC 2 D 2 E = ABE 
(ABCD)(BDF) = AB 2 CD 2 F = ACF 
(CDE 、 (BDF ) 二 BCD 2 EF = BCEF 
(ABCD)(CDE) = AB 2 C 2 D 3 EF = ADEF 

Hence the seven interactions ABCD, CDE, BDF, ABE, ACF, BCEF, and 
ADEF will be confounded with blocks. □ 


We give in Table 8.3 typical examples of systems of confounding for up to 8 
factors. Only those systems are given that result in a block size of 16 or less. The 
allocation of letters to the factors is a matter for the experimenter, and the type 
exemplified by ABC*, ABD*, CD for 4 factors in blocks of 4 experimental units 
includes 5 additional systems, namely, AB*, ACD*, BCD .， AC*, ABD*, BCD .， 
AD*, ABC*, BCD; BC* ， ABD'ACD; BD*,ABC*, ACD. If the experi¬ 
menter wishes to use this type of confounding, he should decide which of the six 
2-factor interactions is least important and choose the system involving that one. 



288 


CONFOUNDING IN 2 n FACTORIAL DESIGNS 


Table 8.3 T^pes of Confounding for 2 n Factorial Systems in Blocks of Size k 

n k Confounded Effects^ 


2 

3 


4 


7 


2 Any one effect or interaction 

4 Any one effect or interaction 

2 AB*,AC*,BC 

A*, BC'ABC 

8 Any one effect or interaction 

4 A*, BCD*, ABCD 

AB*,CD*,ABCD 
AB*,ACD*,BCD 

2 A*, B* ， AB, C*, AC, BC, ABC 

A*, BC* ， ABC, BD' ABD, CD, ACD 
AB*, AC*, BC, AD*, BD, CD, ABCD 

16 Any one effect or interaction 

8 A*,BCDE*,ABCDE 

AB*,CDE*,ABCDE 
ABC*,CDE*,ABDE 

4 A*, B*, AB, CDE*, ACDE, BCDE, ABCDE 

A*, BC' ABC, DE*, ADE, BCDE, ABCDE 
A*, BC* ， ABC, CDE' ACDE, BDE, ABDE 
AB*, AC*, BC, DE\ ABDE, ACDE, BCDE 
AB*, CD*, ABCD, BDE*, ADE, BCE, ACE 
2 AB*, AC*, AD*, AE*, and all GIs 

A*, BC*, AD*, BE *，and all GIs 

16 A5CD*, CDEF*, ABEF 

8 AC£*, BDE' ABCD, BCF*, ABEF, CDEF, ADF 

4 AB*, CD*, ABCD, EF*, ABEF, CDEF, ABCDEF 
ACE^, BCE*, ADE, BDE*, ACF, BCF, ADF, BDF 
2 AB*, AC*, AD*, AE' AF*, and all GIs 

[not particularly useful except with partial 
confounding (see Chapter 9)] 

A*, BC*, BD* ， BE*, BF*, BG *，and all GIs 

16 ACEG*, BDE' ABCDG, BCF*, ABEFG, CDEF, ADFG 
8 ABC*, ADE*, BCDE, BDF*, ACDF, ABEF, CEF, ABCDEFG* 

DEFG, BCFG, AFG, ACEG, BEG, CDG, ABDG 
4 A5*, AC*, AD*, AE' AF*, AG*, and all GIs 

[again not particularly useful] 

A*, BC*, BD*, BE' 5G*, and all GIs 

16 As for 7 factors in blocks of 8 

ABC*, ADE^, AFG*, BDG' CH' and all their GIs; 

CH is the only 2-factor interaction confounded. 


Effects with an asterisk (*) are the independent effects of (8.10). 
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In general one would like to avoid confounding of main effects and 2-factor 
interactions. There is a remarkable theorem obtained by Fisher (1942) that this 
is possible only if the number of units in a block is larger than the number of 
factors. We shall return to this theorem in Section 11.7. 


8.3 COMPOSITION OF BLOCKS FORA PARTICULAR SYSTEM 
OF CONFOUNDING 


Once it has been decided which effects and interactions are to be confounded 
with blocks, that is, once the set of independent interactions has been chosen, the 
composition of the individual blocks is (apart from randomization) completely 
determined. In fact, varying the right-hand side of the system of equations (8.10) 
in all possible ways yields the sets of treatment combinations that are to be 
allocated to the various blocks. We shall now describe a more systematic way of 
determining these sets. 


8.3.1 Intrablock Subgroup 

We consider the general case of a 2 n factorial in blocks of size 2 P . Suppose we 
confound the interactions E Us (s = 1,2,..., n — p) with a s defined in (8.9). Out 
of the 2 n _ p possible sets of equations (8.10) we then consider the set 


仪 11^1 

+ ^12^2 


.+ 

^\n^n = 0 


Qf21^i 

+ a22X2 


.+ 

^2n^n = ^ 

( 8 . 12 ) 


^n—p,2^2 


.+ 

^■n—p,n^n = ^ 



As pointed out earlier, Eqs. (8.12) are satisfied by 2 P treatment combinations 
x f j = (xj\,Xj 2 , .. ., Xj n )(j = 1,2,..., 2 P ). We shall now exhibit the structure 
of this set, which will prove to be useful for the construction of all blocks. 

We note that (i) the treatment combination 0’ = (0, 0, . • • ， 0)，often referred 
to as the control, satisfies (8.12); and (ii) if two treatment combinations, Xk and 
xt say, satisfy (8.12), then the treatment combination 

x k~^ x i = ( x kl + + Xt2, ...,X kn -\- Xtn) (8.13) 

also satisfies (8.12), where each component in (8.13) is reduced mod 2. Hence, if 
we define “addition” of treatment combinations by (8.13), then the 2 P treatment 
combinations satisfying (8.12) form an Abelian group with respect to this addition 
operation and with as the zero element. More precisely, it forms a subgroup 
of the group of all 2 n treatment combinations. In the context of systems of 
confounding this subgroup is referred to as the intrablock subgroup (IBSG). 
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In terms of the reduced representation of treatment combinations, that is, (1), 
a ， b ， ab, c,..the operation for this subgroup is the ordinary algebraic multipli¬ 
cation with the square of any letter replaced by unity. The symbol (1) acts as the 
unity element of the group. 

If we examine Eq. (8.12)，we see immediately that associated with the nonzero 
a s i for each s = 1 , 2, ..., w — p there can be only an even number of Xi(i = 
1,2，...，"）equal to unity. This implies that each treatment combination in the 
IBSG has an even number of letters in common with each confounded interaction. 
Because of the group property of the IBSG, one only needs to find, in addition 
to the control, p “independent” treatment combinations that satisfy (8.12). The 
remaining ones are then determined automatically. 

We shall illustrate this procedure by continuing Example 8.1. 


Example 8.1 (Continued) Equations (8.12) are of the form 
X\ X2 Xo, X4 = 0 

X3 + X4 + X5 =0 

X 2 + X 4 +X6 =0 . 

The following treatment combinations satisfy these equations; that is, consti¬ 
tute the IBSG: 


1 . ( 0 , 0 , 0 , 0 , 0 , 0 ) 
2 . ( 1 , 1 , 1 , 1 , 0 , 0 ) 

3. (1,0, 1,0, 1,0) 

4. (0,1,0, 1,1,0) 

5. (1,0,0, 1 ， 1,1) 

6. (0,1, 1,0, 1,1) 

7. (0,0, 1 ， 1,0,1) 

8 . ( 1 , 1 , 0 , 0 , 0 , 1 ) 


(by adding 2 and 3) 

(by adding 2 and 5) 
(by adding 3 and 5) 
(by adding 4 and 5) 


In terms of the reduced representation these treatment combinations are (1), 
abed, ace, bde(= abed x ace), adef ， bcef(= abed x adef), cdf(= ace x 
adef), abf(= bde x adef). 

8.3.2 Remaining Blocks 

So far we have described the composition of one block. Rather than change the 
right-hand side of (8.10) and solve the resulting equations to obtain the remaining 
2 n ~ p — 1 blocks, we make use of the IBSG and construct the blocks in the 
following way. Suppose the treatment combinations in the IBSG are denoted by 

Xy = (X j\, ^jn) (j = 1 ， 2 ， . . • ， 2 户） 
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and suppose further that y f = (ji, , y n ) is a treatment combination not in 

the IBSG. Then the treatment combinations 

{x f j + y ； j = l,2,...,2P} (8.14) 

form a new set of 2 P treatment combinations. Obviously, these treatment combi¬ 
nations satisfy some set of equations (8.10) with ( 8 \, 82 ,, S n ) ^ ( 0 , 0 ,... , 0 ) 
and hence are different from the treatment combinations in the IBSG. Thus these 
treatment combinations form another block. 

This process is now repeated, replacing y f in (8.14) by a treatment combination 
z’，where z! is neither in the IBSG nor in a previously constructed set (block). 
This gives an easy and systematic way of constructing all blocks. We illustrate 
this again with a continuation of Example 8.1. 

Example 8.1 (Continued) Let block 1 be the IBSG. The first treatment com¬ 
bination of each subsequent block takes the role of y f in (8.14). The eight blocks 
then are given in Table 8.4. 

In terms of the reduced representation for a treatment combination the method 
just described means that the treatment combinations in any one block may be 
obtained from those in the IBSG by multiplying the symbols of the treatment 
combinations in the IBSG by the symbol of a treatment combination not contained 
in any previous block, replacing the square of any letter by unity. 

Also, as a consequence of the procedure, we have that when an effect or 
interaction is confounded with blocks, all treatment combinations in a block 
have either an even number of letters in common with the effect or interaction 
or an odd number. No two treatment combinations in the same block can have 
an odd number of letters and the other an even number of letters in common with 
any effect or interaction that is confounded. This serves as a convenient check. 

8.4 DETECTING A SYSTEM OF CONFOUNDING 

Occasionally we may be confronted with the problem of analyzing an experiment 
that we have not designed. It may then be necessary to discover the system of 
confounding on which the plan for the experiment is based. This can be done by 
reversing the procedure outlined in the previous section. 

To illustrate, consider the plan given for Example 8.1. Take the IBSG, that is, 
the block containing the control. We then know that each treatment combination in 
that block must have an even number of letters in common with each confounded 
interaction. Inspection shows that no 2-factor interaction meets that requirement; 
hence no 2-factor interaction can be confounded with blocks. Next going through 
the 3-factor interactions in a systematic fashion one finds that ABE and ACF meet 
the requirement. Hence they and their GI, BCEF, are confounded with blocks. 
Finally, BDF is found to be confounded with blocks, and so then are all GIs of 
BDF with the previously identified interactions. Thus the system of confounding 
has been determined completely. 
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Table 8.4 Basic Plan for 2 6 Factorial in Blocks of Size 8 


Block 1 

Block 2 

Block 3 

(0, 0, 0, 0, 0, 0) 

(1, 0, 0, 0, 0, 0) 

(0, 1, 0, 0, 0, 0) 

(1, 1, 1, 1, 0, 0) 

(0, 1, 1, 1, 0, 0) 

(1, o, 1, 1, 0, 0) 

(1, 0, 1, 0, 1, 0) 

(0, 0, 1, 0, 1, 0) 

(1,1,1, o, 1, 0) 

(0, 1, 0, 1, 1, 0) 

(1,1,0,1, 1,0) 

(0, 0, 0, 1, 1, 0) 

(1, 0, 0, 1, 1, 1) 

(0, 0, 0, 1, 1, 1) 

(1, 1,0, 1, 1, 1) 

(0, 1, 1, 0, 1 ， 1 ) 

(1 ，1 , 1 ，0 , 1 ， 1 ) 

(0, 0, 1, 0, 1, 1) 

(0, 0, 1, 1 ，0 , 1) 

(1,0, 1 ， 1 , 0, 1) 

(0, 1, 1, 1, 0, 1) 

(1,1,0, 0, 0, 1) 

(0, 1, 0, 0, 0, 1) 

(1, 0, 0, 0, 0,1) 

Block 4 

Block 5 

Block 6 

(0, 0, 1 ，0 , 0, 0) 

(0, 0, 0, 1, 0, 0) 

(0, 0, 0, 0, 1, 0) 

(1 ，1 , 0, 1, 0, 0) 

(1,1,1,0,0, 0) 

(1,1,1,1,1,0) 

(1 ， 0 , 0, 0, 1 ， 0 ) 

(1,0, 1, 1, 1,0) 

(1, 0, 1, 0, 0, 0) 

(0, 1, 1, 1 ， 1 ， 0) 

(0, 1, 0, 0, 1, 0) 

(0, 1, 0, 1, 0, 0) 

(1 ，0 , 1, 1 ，1， 1 ) 

(1,0,0, 0, 1, 1) 

(1,0,0, 1, 0, 1) 

(0, 1, 0, 0, 1 ， 1 ) 

(0, 1, 1 ， 1 ， 1 ， 1) 

(0, 1, 1 ， 0 , 0, 1) 

(0, 0, 0, 1, 0, 1) 

(0, 0, 1 ， 0 , 0, 1) 

(0, 0, 1 ， 1 ， 1 ， 1) 

(1, 1 ， 1 ,0,0, 1) 

(1,1,0, 1, 0, 1) 

(1 ，1，0 ,0, 1 ， 1 ) 

Block 7 

Block 8 


(0, 0, 0, 0, 0, 1) 

(1, 1, 1, 1, 1, 1) 


(1, 1 ， 1 , 1, 0, 1) 

(0, 0, 0, 0, 1, 1) 


(1, o, 1, 0, 1, 1) 

(0, 1, 0, 1, 0, 1) 


(0, 1, 0, 1, 1 ， 1 ) 

(1, o, 1, 0, 0, 1) 


(1 ， 0 , 0, 1, 1 ， 0 ) 

(0, 1, 1, 0, 0, 0) 


(0, 1, 1 ，0 , 1 ， 0 ) 

(1, 0, 0, 1, 0, 0) 


(0, 0, 1, 1 ， 0 , 0) 

(1,1, o, 0, 1, 0) 


(1, 1, 0, 0, 0, 0) 

(0, 0, 1, 1, 1, 0) 



This then exemplifies the general procedure that should be used in a case like 
this: Start with main effects, then 2-factor interactions, and so forth and find 
n — p independent interactions that satisfy the even-or-odd-number-of-letters-in- 
common requirement for a given block. This presupposes, however, that a proper 
system of confounding has been used except that one does not know which. If 
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one is not sure, one should check other blocks to see whether they too conform 
to the even-or-odd-numbers-of-letters-in-common requirement. 


8.5 USING SAS FOR CONSTRUCTING SYSTEMS 
OF CONFOUNDING 


In this section we shall illustrate briefly how the SAS procedure PROC FACTEX 
can be used to generate systems of confounding. 


Example 8.2 In Table 8.5 we present the SAS input and output for the 2 3 
factorial in blocks of size 4, assuming that the 3-factor interaction is negligible. 
The keys to generating the system are (1) specification of the block size and 
(2) specification of the effects to be estimated, in this example A, B, C, AB, 
AC, BC. The “Block Pseudo-factor Confounding Rules” give then the effect(s) 
confounded with blocks, in this example ABC. □ 


Example 8.3 This example is to illustrate how, in the 2 3 factorial with blocks 
of size 4, any effect can be confounded with blocks. For example, we may know 
already something about the A effect from a previous experiment, but we are 
mainly interested in the other main effects and all interactions. Table 8.6 gives 
such a plan. □ 

Example 8.4 Consider the 2 4 factorial in blocks of size 4. It is intended to 
estimate all the main effects and as many 2-factor interactions as possible, and 
then as many 3-factor interactions as possible. From Table 8.3 we see that design 
3 iox n = A,k = A seems an appropriate choice, since it sacrifices information 
on one 2-factor interaction, AB ，and two 3-factor interactions, ACD and BCD. 
This knowledge determines the model statement in Table 8.7 and yields the 
basic arrangement given in Table 8.7 The two independent interactions to be 
confounded with blocks are listed as BCD and ACD, which implies, of course, 
that the generalized interaction, AB, is also confounded with blocks. □ 


8.6 ANALYSIS OF EXPERIMENTS WITH CONFOUNDING 


8.6.1 Estimation of Effects and Interactions 

We consider the 2 n experiment with blocks of size 2 P ， replicating the basic block 
arrangement r times. A linear model appropriate for this design is of the form 
(7.60), which we write more specifically as 


yij(x) = iu, + Pij + r(x) + eij(x) (8.15) 


or, equivalently, 


yij(x) = /X + p/ + + r ⑻ + eij(x) 


(8.16) 
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Table 8.5 2 3 Factorial in Blocks of Size 4 (Confounding ABC) 


options nodate pageno=l ； 

proc factex ； 

factors ABC; 

size design=8 ； 

blocks size=4 ； 

model estimate=(A IBIC @2); 

examine design confounding ； 

output out=design blockname=block nvals=(1 2); 
titlel 'TABLE 8.5'; 

title2 '2**3 FACTORIAL IN BLOCKS OF SIZE 4'; 

title3 'CONFOUNDING ABC'; 

run ； 

proc print data=design ； 
run ； 


The FACTEX Procedure 


Design Points 


Experiment 

Number ABC Block 


1 

2 

3 

4 

5 

6 

7 

8 



1 

1 

1 

1 



1 

1 


1 

1 



1 

2 

2 

1 

2 

1 

1 

2 


Block Pseudofactor Confounding Rules 
[Bl] = A*B*C 


Obs 


block 


A 


B 


C 
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Table 8.6 2 3 Factorial in Blocks of Size 4 (Confounding Main Effect A) 

options nodate pageno=l ； 
proc factex ； 
factors ABC; 
blocks size=4 ； 

model est=(B A*B C A*C B*C A*B*C); 
examine design confounding ； 

output out=design blockname=block nvals=(1 2); 
titlel 'TABLE 8.6'; 

title2 '2**3 FACTORIAL IN BLOCKS OF SIZE 4'; 

title3 'CONFOUNDING MAIN EFFECT A 1 ; 

run ； 

proc print data=design; 
run ； 



The 

FACTEX 

Design 

Procedure 

Points 


Experiment 

Number 

A 

B 

C 

Block 

1 

-1 

-1 

-1 

1 

2 

-1 

-1 

1 

1 

3 

-1 

1 

-1 

1 

4 

-1 

1 

1 

1 

5 

1 

-1 

-1 

2 

6 

1 

-1 

1 

2 

7 

1 

1 

-1 

2 

8 

1 

1 

1 

2 


Block Pseudofactor Confounding Rules 
[Bl] = A 

Obs block ABC 
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Table 8.7 2 1 2 3 4 * * * * 9 10 11 12 13 14 15 16 Factorial in Blocks of Size 4 

options nodate pageno=l ； 
proc factex ； 
factors A B C D ； 

model est=(A B C A*C B*C D A*D B*D C*D A*B*C A*B*D); 

blocks size=4 ； 

examine design confounding ； 

output out=design blockname=block nvals=(1 2 3 4); 
titlel 'TABLE 8.7'; 

title2 '2**4 FACTORIAL IN BLOCKS OF SIZE 4'; 
run ； 

proc print data=design ； 
run ； 


The FACTEX Procedure 


Design Points 


Experiment 

Number A B C D Block 


1 -1 -1 

2 -1 -1 

3 -1 -1 

4 -1 -1 

5 - 11 

6 - 11 

7 - 11 

8 - 11 

9 1-1 

10 1 -1 

11 1 -1 

12 1 -1 

13 1 1 

14 1 1 

15 1 1 

16 1 1 



Block Pseudofactor Confounding Rules 

[Bl] = B*C*D 
[B2] = A*C*D 


1 


1 

4 

4 

1 

2 

3 

3 

2 

3 

2 

2 

3 

4 
1 
1 
4 
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Table 8.7 (Continued) 


Obs 

block 

A 

B 

C 

D 

1 

1 

-1 

-1 

-1 

-1 

2 

1 

-1 

-1 

1 

1 

3 

1 

1 

1 

-1 

1 

4 

1 

1 

1 

1 

-1 

5 

2 

-1 

1 

-1 

-1 

6 

2 

-1 

1 

1 

1 

7 

2 

1 

-1 

-1 

1 

8 

2 

1 

-1 

1 

-1 

9 

3 

-1 

1 

-1 

1 

10 

3 

-1 

1 

1 

-1 

11 

3 

1 

-1 

-1 

-1 

12 

3 

1 

-1 

1 

1 

13 

4 

-1 

-1 

-1 

1 

14 

4 

-1 

-1 

1 

-1 

15 

4 

1 

1 

-1 

-1 

16 

4 

1 

1 

1 

1 


where pi 

咬 or ~ 

r(x) 

eij(x) 


=effect of the /th replicate (/ = 1 ， 2, ... ， r) 

=effect of jth block in /th replicate (j = 1 , 2 ,..., 2 n ~ p ) 
= effect of treatment combination x' = (xi, X2,..., x n ) 
(xi = 0 , 1 ; € = l, 2, ..., n) 

=error associated with treatment combination x in the 
jth block of the ith replicate, 


or in matrix notation 

j = /x3+X p /> + X〆 + X r r+e (8.17) 

with the obvious definition of all terms in this model equation. Let £\ = {E ai , i = 
1 , 2, … ， q(= 2 n ~ p — 1)} be the set of interactions confounded with blocks, and 
let £2 = {E ym , m = 1,2,, s[= 2 n ~ p (2 p — 1)]} be the set of interactions not 
confounded with blocks. (We note that we use here the word interaction in a 
general sense that includes also main effects as 1-factor interactions.) We then 
obtain in the usual way, 〜 using the Yates algorithm for example, estimates E^ m 
for each E Vm e £2 and E ai for each E at g £ 1 , such that £ 〜 is BLUE and E ai 
is a biased estimate. From the general discussion in Chapter 7 it also follows that 

var(f〜）= -^i°e (8.18) 


8.6.2 Parameterization of Treatment Responses 

It is clear from the discussion above that only the E Ym g £2 are estimable. 
It is also obvious from the construction of a confounded design that certain 
treatment combinations never occur together in the same block and hence cannot, 
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ordinarily, be compared with each other. We thus have a disconnected design, 
which generally is an undesirable situation. In the factorial setting this negative 
feature may be overcome, however. If we can assume that all e £1 are 
negligible, then the reparameterization (7.42) and (7.43) may be rewritten as 

a(x) = M + ^2 E y r m x 
y m 
or 

a{x) = M + ^ / <p(y m ， x)E y « 
y m 


respectively, where is summation over all y m such that E ym e S 2 . For any 
two treatment combinations x and z, we can then obtain 

«(x) - fl(z) = ^ [0(y m , x) - 0 (y m , z)] E Ym 
y m 


where 0(y m , x) — 0(y m , z) is either 0, 1, or —1. Using (8.18) we can then 
evaluate var \a(x) — a'(z)]. 

As an illustration of this procedure consider the following example. 


Example 8.5 Suppose we have a 2 3 experiment in blocks of size 4, con¬ 
founding ABC. We have the basic arrangement 

Block 1: (1), ab, ac, be 

Block 2: a,b, c, abc 

which is replicated r times. Suppose a(x) = abc, a(z) = (1). Then, assuming 
ABC to be negligible, we can write 


and 


and hence 


and 


^bc = M + \A + \B + + l；C + bM ： + l^BC 


2^ I 2 1 


(T) = M- \a - + ^ab - ic+ \ac+ \bc 


2 ^ 1 2 2 


abc — (1) = A + B + C 


var [abc — ( 1 ) 


rl n ~ 


□ 


8.6.3 ANOVA Tables 

The ANOVA table for a system of confounding as described in this section 
follows the basic outline of the ANOVA table for incomplete block designs (see 
Table 1.3) and is as given in Table 8.8a. A partitioning of the various sums of 
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Table 8.8 ANOVA Tables for System of Confounding 



Eh 1 r2 n ~ 2 [ 私 ] 2 

/|1 X^,X T (r- 1 ) 2 n ~P {IP - 1 ) = (r - 1 > Subtraction 
T— r2"-l 

i j x 
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squares is then given in Table 8.8b. Although quite self-explanatory we shall 
comment on these partitionings briefly. 

The partitioning 


SS(X^\3) = S S (Replicates) + SS(Blocks/reps) 
follows immediately from model (8.16) and can be written more precisely as 

SS(Z^ I 3) = SS(X P I 3) + SS(X r 1 3, X p ) (8.19) 

Since each treatment combination occurs exactly once in each replicate, 
SS(Z p |3) is orthogonal to treatments in the same sense that blocks and treatments 
are orthogonal to each other in a randomized complete block design. However, 
SS(Xp*\0, X p ) is not orthogonal to treatments. This fact becomes evident from 
the partitioning 


q 



SS(X/s* 1 0, Xp) = SS(E ac ) + SS(Remainder) 

(8.20) 

where 

1=1 


with 

SS(E at ) = r2 n _ 2 [E ai f 

(8.21) 


ESS(E at ) = r2 n ~ 2 [E at f + Q t {^) + a] 

(8.22) 


and Qi(Pfj) a quadratic function of block effects corresponding to the block 
contrast defining E 011 in the chosen system of confounding. 

There is another way of writing (8.20) that is informative. Let be the 
estimate of E ai in the zth replicate (/ = 1, 2,..., r) and let 

SS(E^) = 2 • 卜 1 [E^ l f (8.23) 

be the sum of squares associated with that estimate. It is clear then that 


r q 

ss(x r m) = e e ss(s 广 ) 


q 

= SS(E ai ) 



^ V r 

+E E SS(£ ; a€ ) - SS(E ae ) 

£= 1 _ i=1 

Hence, in (8.20) 

^ r r — 

SS (Remainder)= EE SS(£f € ) - SS(E a ” 


(8.24) 


(8.25) 
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The purpose of writing SS(X^* | 3, X p ) in the form (8.24) is to partition it 
into what is sometimes referred to as a treatment component and a pure block 
component. Obviously, SS(E a£ ) depends both on treatment effects, namely 
the E a£ , and on block effects. As we shall show below SS(Remainder) does not 
depend on treatment effects, hence h SS ( 五 a£ ) is the treatment component of 
the block sum of squares mentioned above. 

We now consider SS (Remainder) and derive (8.25) in a different way, which 
will show that SS (Remainder) is algebraically equivalent to a sum of squares 
associated with the replicate x E at (t = 1,2,... ,q) interactions. Let 


= {average observed response in replicate i 
of all treatment combinations satisfying 
ct^x = 5 (5 = 0 , 1 )} 

We note that is the mean of 2 n ~ l observations and that — 3^.0. 

We then consider the following two-way table: 

Replicate (/) 



1 

2 . 

.. r 


0 

5 = 

A 01 

Y 0i£ 

^02 • 

.• A 0r 


1 

A 11 

A 12 

v oie 

.• A lr 




A 2 • 

• . -/V .y 

x ai 


The interaction sum of squares in this table (on a per-observation basis) is 


ss ( 仆 =2"- 1 e e 

i 8 

i 8 


since = yj(-) and = 歹 The expression above for SS(I)i shows that 
it is free of treatment effects as those cancel within each square bracket. Further, 


ss ( 仆 = 2, EE[^]+ r E[^]-2EE 聆印 

l i 8 8 i 8 


= 2 ° _1 EE[^]- r E[^] 2 (8.26) 

l i 8 8 J 

=V (£p) — SS (E ae ) 
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using (7.65). Hence 

q 

SS(I)i = SS (Remainder) 

£=i 

as given in (8.25). 

Turning now to SS(Z T | 3, the treatment sum of squares adjusted for block 
effects with s = 2 n ~ p (2 p — 1) d.f” we can partition it into 5 single-d.f. sums 
of squares associated with the unconfounded interactions E ym (m = 1,2,, s), 
given by 

SS(E r "') = r2 n ~ 2 [E y -f (8.27) 

We then use the F test (as an approximation to the randomization test) 

尸一 SS (則 

MS (Residual) 

to test the hypothesis Ho ： E ym =0. 

Finally, corresponding to SS (Remainder) in (8.25) we can partition SS (Error) 
as s sums of squares with r — 1 d.f. each, that is, 

s r r ~ 

SS(Error)= EE SS(£f") —SS(£^) I (8.28) 

m=l L/=l _ 

where 

SS (Ej m ) = 2 n ~ 2 [E^ m f 

and Ej m is the BLUE of E Ym in the ith replicate. Obviously, then, (8.28) is a 
function of the e"(x) only and 

r ~ 

E J2 SS (d — SS (E y '") = (r - l)a e 2 (8.29) 

_i = l _ 

for m = 1,2,... ,s. Hence we have s independent estimates of cr^, a fact that 
may be useful for testing homogeneity of error. 

To complete this discussion about the various sums of squares in Table 8.8b, 
we mention that just as 

SS(I)i = - SS(E at ) 

i 

in (8.25) is algebraically the (replicate x E ae ) interaction sum of squares with 
r — 1 d.f. (£ = 1,2, , q), 

SS(/) m =^SS(£f m ) - SS(Eh) 
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in (8.28) is algebraically the (replicate xE ym ) interaction sum of squares with 
r — 1 d.f. (m = 1， 2, ... ， s). 

8.7 INTERBLOCK INFORMATION IN CONFOUNDED 
EXPERIMENTS 

In discussing the problem of confounding and the analysis of confounded exper¬ 
iments we have assumed that the block effects are fixed effects, as is evident 
from Eq. (8.3)，for example. If, however, we consider block effects as random 
effects, then it becomes clear that, again referring to (8.3), 

長五 [block 2 — block 1] = ABC 

that is, the block difference is an unbiased estimate of ABC. The only difference 
between this estimator, ABC say, and the estimators for the other effects and 
interactions, A,..., BC is that with r replications 

var(A) = ••- = var(BC) = — aj 

2r 

whereas 

var(ABC) = {a] + 4cr^) 

From the nature of the estimators it is clear that A,..., BC are intrablock esti¬ 
mates and that ABC is an interblock estimate as it is a linear function of block 
totals only (see Section 1.7). 

For the general case of a 2 n factorial in blocks of size k = 2 P ， using an 
appropriate system of confounding, we are then able to estimate all interactions as 


Table 8.9 ANOVA for Interblock Analysis 


Source 

d.f. 

SS 

Replicates 

r- 1 


E ai 

1 

+2Pa^ + r2 n ~ 2 [E a '] 2 

E 012 

1 

cr^+2Pa^ + r2 n - 2 [E a2 ] 2 

E a q 

1 

a}+2Paj + r2 n ~ 2 [E a i} 2 

Remainder 

(r- \)(2 n -P - 1) 

°e + 2Pa j 


(8.30) 

(8.31) 
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Table 8.10 Data and Intrablock Analysis for 2 3 Factorial in Blocks of Size 4 With 
ABC Confounded with Blocks 


options nodate pageno=l ； 
data example ； 

input rep block A B C y @@； 
datalines ； 

11000 14 11011 25 

11101 17 11110 20 

12001 13 12010 16 

12100 14 12111 26 

23000 24 23011 34 

23101 27 23110 29 

24001 24 24010 26 

24100 25 24111 35 


run ； 

proc print data=example ； 
titlel 'TABLE 8.10'; 

title2 'DATA FOR 2**3 FACTORIAL IN BLOCKS OF SIZE 4'; 

title3 'WITH ABC CONFOUNDED WITH BLOCKS'; 

run ； 

proc glm data=example ； 
class rep block ABC; 
model y=rep block(rep) A|B|C@2 ； 



estimate 'A*B' A*B 1 -1 -1 l/divisor=2 ； 

title2 1 INTRA-BLOCK ANALYSIS OF 2**3 FACTORIAL'; 

title3 'IN BLOCKS OF SIZE 4'; 

title4 'WITH ABC CONFOUNDED WITH BLOCKS'; 

run ； 


Obs 

rep 

block 

A 

B 

c 

y 

1 

1 

1 

0 

0 

0 

14 

2 

1 

1 

0 

1 

1 

25 

3 

1 

1 

1 

0 

1 

17 

4 

1 

1 

1 

1 

0 

20 

5 

1 

2 

0 

0 

1 

13 

6 

1 

2 

0 

1 

0 

16 

7 

1 

2 

1 

0 

0 

14 

8 

1 

2 

1 

1 

1 

26 

9 

2 

3 

0 

0 

0 

24 

10 

2 

3 

0 

1 

1 

34 

11 

2 

3 

1 

0 

1 

27 

12 

2 

3 

1 

1 

0 

29 

13 

2 

4 

0 

0 

1 

24 

14 

2 

4 

0 

1 

0 

26 

15 

2 

4 

1 

0 

0 

25 

16 

2 

4 

1 

1 

1 

35 
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Table 8.10 (Continued) 

INTRABLOCK ANALYSIS OF 2**3 FACTORIAL 
IN BLOCKS OF SIZE 4 
WITH ABC CONFOUNDED WITH BLOCKS 

The GLM Procedure 

Class Level Information 

Class Levels Values 
rep 2 12 

block 4 1234 

A 2 0 1 

B 2 0 1 

C 2 0 1 

Number of observations 16 

Dependent Variable : y 

Source DF Sum of Squares Mean Square F Value Pr > F 

Model 9 699.0625000 77.6736111 248.56 <.0001 

Error 6 1.8750000 0.3125000 

Corrected Total 15 700.9375000 


Source 

R- 

0 . 

DF 

Square Coeff Var Root MSE 

997325 2.423922 0.559017 

Type I SS Mean Square 

y Mean 

23.06250 

F Value 

Pr > F 

rep 

1 

390.0625000 

390.0625000 

1248.20 

<.0001 

block(rep) 

2 

8.1250000 

4.0625000 

13.00 

0.0066 

A 

1 

18.0625000 

18.0625000 

57.80 

0.0003 

B 

1 

175.5625000 

175.5625000 

561.80 

< . 0001 

A*B 

1 

0.0625000 

0.0625000 

0.20 

0.6704 

C 

A*C 

1 

68.0625000 

68.0625000 

217.80 

<.0001 

1 

0.0625000 

0.0625000 

0.20 

0.6704 

B*C 

1 

39.0625000 

39.0625000 

125.00 

<.0001 

Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

rep 

1 

390.0625000 

390.0625000 

1248.20 

<.0001 

block(rep) 

2 

8.1250000 

4.0625000 

13.00 

0.0066 

A 

1 

18.0625000 

18.0625000 

57.80 

0.0003 

B 

1 

175.5625000 

175.5625000 

561.80 

<.0001 

A*B 

1 

0.0625000 

0.0625000 

0.20 

0.6704 

C 

A*C 

1 

68.0625000 

68.0625000 

217.80 

<.0001 

1 

0.0625000 

0.0625000 

0.20 

0.6704 

B*C 

1 

39.0625000 

39.0625000 

125.00 

<.0001 

Parameter 


Estimate 

Standard 

Error t 

Value Pr 

> Itl 

A 


2.12500000 

0.27950850 

7.60 0 

. 0003 

A*B 


0.12500000 

0.27950850 

0.45 0 

. 6704 
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Table 8.11 Data and Combined Analysis for 2 3 Factorial in Blocks of Size 4 With 
ABC Confounded with Blocks 


options nodate pageno=l ； 
data example ； 

input rep block A B C y @@; 
datalines ； 


1 1 0 0 0 14 
1 1 1 0 1 17 
1 2 0 0 1 13 

1 2 1 0 0 14 

2 3 0 0 0 24 
2 3 1 0 1 27 
2 4 0 0 1 24 
2 4 1 0 0 25 


1 1 0 1 1 25 
1 1 1 1 0 20 
1 2 0 1 0 16 
1 2 1 1 1 26 

2 3 0 1 1 34 
2 3 1 1 0 29 
2 4 0 1 0 26 
2 4 1 1 1 35 


run ； 

proc print data=example ； 
titlel 'TABLE 8.11'; 

title2 'DATA FOR 2**3 FACTORIAL IN BLOCKS OF SIZE 4'; 
title3 'WITH ABC CONFOUNDED WITH BLOCKS 1 ; 


proc mixed data=example ； 

class rep block A B C ； 

model y=rep A|B|C/ddfm=satterth ； 

random block(rep )； 

estimate 1 A 1 A -1 1 ； 

estimate 1 A*B' A*B 1 -1 -1 l/divisor=2 ； 

estimate 1 A*B*C 1 A*B*C -111-11-1 -1 l/divisor=4 ； 

titlel 'TABLE 8.11'; 

title2 'INTRA- AND INTER-BLOCK ANALYSIS 1 ; 
title3 'OF 2**3 FACTORIAL IN BLOCKS OF SIZE 4'; 
title4 'WITH ABC CONFOUNDED WITH BLOCKS'; 
run ； 

Obs rep block A B C y 
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Table 8.11 (Continued) 


INTRA- AND INTERBLOCK ANALYSIS 
OF 2**3 FACTORIAL IN BLOCKS OF SIZE 4 
WITH ABC CONFOUNDED WITH BLOCKS 


The Mixed Procedure 
Model 


Data Set 

Dependent Variable 
Covariance Structure 
Estimation Method 
Residual Variance Method 
Fixed Effects SE Method 
Degrees of Freedom Method 


Information 

WORK.EXAMPLE 

y 

Variance Components 
REML 
Profile 
Model-Based 
Satterthwaite 


Class Level Information 


Class Levels Values 


rep 

block 

A 

B 

C 


2 12 

4 12 3 4 

2 0 1 

2 0 1 

2 0 1 

Dimensions 


Covariance Parameters 2 
Columns in X 29 
Columns in Z 4 
Subjects 1 
Max Obs Per Subject 16 
Observations Used 16 
Observations Not Used 0 
Total Observations 16 


Iteration History- 


Iteration 


Evaluations 


-2 Res Log Like Criterion 


0 

1 


1 19.41205069 

1 19.24234227 0.00000000 
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Table 8.11 (Continued) 



Convergence Criteria met. 





Covariance 

Parameter 





Estimates 





Cov Parm 

Estimate 





block(rep) 

0.06250 





Residual 

0.3125 





Fit Statistics 





-2 Res Log Likelihood 


19.2 



AIC (smaller 

is better) 


23.2 



AICC (smaller is better) 


26.2 



BIC (smaller 

is better) 


22.0 



Type 3 Tests 

of Fixed Effects 



Effect 

Num DF 

Den DF 

F Value 

Pr > F 

rep 

1 

1 

693 

44 

0.0242 

A 

1 

6 

57 

80 

0.0003 

B 

1 

6 

561 

80 

< . 0001 

A*B 

1 

6 

0 

20 

0.6704 

C 

1 

6 

217 

80 

< . 0001 

A*C 

1 

6 

0 

20 

0.6704 

B*C 

1 

6 

125 

00 

< . 0001 

A*B*C 

1 

1 

13 

44 

0.1695 



Estimates 





Standard 




Label 

Estimate Error DF 

t 

Value 

Pr > |t| 

A 

2.1250 0. 

.2795 6 


7.60 

0.0003 

A*B 

0.1250 0. 

,2795 6 


0.45 

0.6704 

A*B*C 

-1.3750 0. 

,3750 1 


-3.67 

0.1695 


follows (using the notation from the previous section): E a£ (£ = 1,2,, 2 n _ p — 
1) from interblock information only and with variance l/r2 w_2 (crj + 2 p cr^), 
E Ym [m = 1,2,..., 2 n ~ p (2 p — 1)] from intrablock information only and with 
variance (l/r2 n ~ 2 ) aj. The fact that either intrablock or interblock informa¬ 
tion only is available for a particular interaction is a major difference between 
the block arrangements we have been discussing in this chapter and the block 
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Table 8.12 Auxiliary Analysis of 2 3 Factorial With Blocks of Size 4 and ABC 
Confounded with Blocks 

options nodate pageno=l ； 
proc glm data=example ； 
class rep block A B C ； 
model y=rep|A|B|C ； 
titlel 'TABLE 8.12'; 

title2 'AUXILIARY ANALYSIS OF 2**3 FACTORIAL 1 ; 

title3 'WITH BLOCKS OF SIZE4 AND ABC CONFOUNDED WITH BLOCKS'; 

run ； 

The GLM Procedure 

Class Level Information 

Class Levels Values 
rep 2 12 

block 4 1234 

A 2 0 1 

B 2 0 1 

C 2 0 1 

Number of observations 16 

Dependent Variable : y 

Sum of 


Source 

DF 

Squares 

Mean Square 

F Value 

Pr : 

> F 

Model 

15 

700.9375000 

46.7291667 





Error 0 0.0000000 

Corrected Total 15 700.9375000 


R-Square Coeff Var Root MSE y Mean 


1.000000 . . 23.06250 





310 


CONFOUNDING IN 2 n FACTORIAL DESIGNS 


Table 8.12 (Continued) 


Source 

DF 

Type 工 SS 

Mean Square 

F Value 

Pr > F 

rep 

1 

390.0625000 

390.0625000 



A 

1 

18.0625000 

18.0625000 



rep*A 

1 

0.0625000 

0.0625000 



B 

1 

175.5625000 

175.5625000 



rep*B 

1 

1.5625000 

1.5625000 



A*B 

1 

0.0625000 

0.0625000 



rep*A*B 

1 

0.0625000 

0.0625000 



C 

1 

68.0625000 

68.0625000 



rep*C 

1 

0.0625000 

0.0625000 



A*C 

1 

0.0625000 

0.0625000 



rep*A*C 

1 

0.0625000 

0.0625000 



B*C 

1 

39.0625000 

39.0625000 



rep*B*C 

1 

0.0625000 

0.0625000 



A*B*C 

1 

7.5625000 

1 • 5625000 



rep*A*B*C 

1 

0.5625000 

0.5625000 




Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

rep 

1 

390.0625000 

390.0625000 



A 

1 

18.0625000 

18.0625000 



rep*A 

1 

0.0625000 

0.0625000 



B 

1 

175.5625000 

175.5625000 



rep*B 

1 

1.5625000 

1.5625000 



A*B 

1 

0.0625000 

0.0625000 



rep*A*B 

1 

0.0625000 

0.0625000 



C 

1 

68.0625000 

68.0625000 



rep*C 

1 

0.0625000 

0.0625000 



A*C 

1 

0.0625000 

0.0625000 



rep*A*C 

1 

0.0625000 

0.0625000 



B*C 

1 

39.0625000 

39.0625000 



rep*B*C 

1 

0.0625000 

0.0625000 



A*B*C 

1 

1 • 5625000 

1 • 5625000 



rep*A*B*C 

1 

0.5625000 

0.5625000 




arrangements we have discussed in connection with incomplete block designs in 
Chapters 1 ， 2, and 4. 

Although the choice of a particular system of confounding is no longer as 
crucial as it is in the case of fixed block effects, it is important nevertheless 
as is evident from the statement about the variances of the different estimators, 
interblock estimators having generally larger variance than the intrablock esti¬ 
mators. This is emphasized also by the way in which hypotheses about the E ai 
can be tested in the ANOVA. To show this we give in Table 8.9 the appropriate 
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五 (MS) in the ANOVA table pertaining to the interblock analysis. The hypothesis 
Hq ： E at = 0 is then tested using an approximate F test by 

F _ SS(E a ^) 

MS (Remainder) 

with (r — — 1) d.f. in the denominator, as compared to (r — l)2 n ~ p (2 p — 

1) for testing Hq: E ym = 0 in the intrablock analysis. 


8.8 NUMERICAL EXAMPLE USING SAS 


The following example illustrates how SAS PROC GLM and PROC MIXED can 
be used to analyze the data from systems of confounding. 

Consider again the 2 3 factorial in blocks of size 4 and with ABC confounded 
with blocks (see Example 8.2). The data are given in Table 8.10 together with 
the intrablock analysis, using PROC GLM. 

The combined analysis, using intrablock information for A, B, AB, C, AC, 
BC and interblock information for ABC, is obtained with PROC MIXED and 
is given in Table 8.11. 

We note here that in order to obtain the correct interblock information; that 
is, test of ABC, it is not sufficient to declare the block effects as random effects, 
but also amend the model statement by the option DDFM = SATTERTH (Note: 
It is always useful to check the d.f. of each F test as at least a partial assurance 
that the correct test has been performed). A check that the correct test for ABC 
has been performed is provided in Table 8.12 

As explained in Section 8.7 (see also Table 8.8b), the denominator for testing 
Hq ： ABC = 0 is given by MS (Remainder) [see (8.25)], which algebraically is 
given by MS (Rep x ABC) with r — 1 = 1 d.f. From Table 8.12 we obtain 

SS(ABC) = 7.5625 and SS(rep x ABC) = 0.5625 


and hence 


7.5625 

F = - 

0.5625 


=13.38 


which is approximately the same as the F value given in Table 8.11，namely 
13.44. " 。 " 

As a final note we mention that estimates for cr 》and are given in Table 
8.11 as 

a; = 0.3125 and aj, = 0.0625 
from which we can also reconstruct MS (Remainder) as 

MS (Remainder) = + = 0.5625 


which agrees with the value given in Table 8.12. 



CHAPTER 9 


Partial Confounding in 2 n 
Factorial Designs 


9.1 INTRODUCTION 

In the previous chapter we have seen how the treatment combinations of a 2 n 
experiment can be accommodated in blocks of size 2 P ， where pen ，and what 
consequences this has with regard to the estimation of effects and interactions. 
The price we have to pay for being able to reduce the error variance is the loss 
of information on certain interactions and possibly main effects. As long as only 
high-order interactions are confounded, the price may not be too high, but as 
is evident from Table 8.3, this may not always be possible; that is, in certain 
cases we may have to confound low-order interactions and/or main effects. This 
is clearly undesirable. We would like to obtain at least partial information on all 
essential effects and interactions. Since most experiments are replicated, it seems 
quite reasonable to use different systems of confounding in different replicates, 
which may achieve the objective just stated. This is known as partial confounding 
(as compared to the complete confounding of Chapter 8). 


9.2 SIMPLE CASE OF PARTIAL CONFOUNDING 
9.2.1 Basic Plan 

Consider the simplest possible factorial scheme, that involving two factors A, 
B ，each at two levels, and suppose that it is necessary to use blocks of two 
experimental units. The necessity of blocks of this size might arise, for example, 
in an experiment on young cattle, because it is possible to obtain a number of 
identical twins, that is, twins of the same genetic constitution and each pair of 
twins forming a block. Or it may be that the experimenter can handle only two 
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Replicate I Replicate II Replicate III 


1 

Block 

a 

ab 

1 

Block 

b 

ab 

1 

Block 

(1) 

ab 







2 

(1) 

b 

2 

(1) 

a 

2 

a 

b 


Figure 9.1 Partial confounding of a 2 2 experiment. 


experimental units at a time and that, if units were treated in random order, the 
experimental error introduced by randomizing over “times” would be consider¬ 
able. Another possible use of blocks of two EUs could arise in plant pathological 
work, the block being a leaf and the EUs the two halves. Other examples in other 
fields of experimental research (e.g., medicine, engineering, psychology) can be 
found easily. 

Given such a situation, suppose also that the experimenter wishes to obtain 
equal information on both main effects and the 2-factor interaction. This implies 
immediately that A, B, and AB should be confounded equally often with blocks. 
The basic pattern of the design would then consist of three replicates arranged 
as in Figure 9.1. The main effect A is confounded in replicate I, B in replicate 
II, and the interaction AB in replicate III. Each effect and interaction is then 
“partially confounded” with blocks. In view of this, we shall estimate the effects 
and interaction from the replicates in which they are unconfounded with blocks, 
namely 


Effect A estimated from replicates II, III 

Effect B estimated from replicates I, III 

Effect AB estimated from replicates I, II. 

These estimates will be subject to an error based on the variance of units within 
blocks of two units treated alike, and, in order to obtain a reasonably precise 
estimate of this error variance, we shall need several repetitions of the basic 
pattern given in Figure 9.1. Suppose we have q repetitions (i.e., 3q replicates in 
all); then the ANOVA will have the structure shown in Table 9.1. 


9.2.2 Analysis 

The ANOVA is based on the three-part model 

yi jki = /x + p/ + Pij + Tkt + eijkt (9.1) 

where pi is the effect of the ith replicate (i = 1,2,..., 3q), is the effect of 
the jth block (j = 1, 2) in the /th replicate, and xu is the effect of the treatment 
combination (k, 1 = 0, 1). The reader will recognize that the basic pattern of 
this design, given in Figure 9.1，is actually a BIB design (see Chapter 2). Hence 
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Table 9.1 Structure of T|B-ANOVA for Partially 
Confounded Design 


Source 

d.f. 

SS 

x p \o 

3q - 1 

- 歹 … +) 2 

X^\3, X p 

3^ 

2 1>,>- 歹 ,:.) 2 

ij 

X z \3, X p ,X^* 

3 


A 

1 

2^ [An ni] 2 

B 

1 

[^i, hi] 2 

AB 

1 

2q [ABn, nf 

I\0, x p , Z T 

3(2^ - 1) 

Difference 

Total 

I2q - 1 

工 /jijkt - 歹 … .) 2 

ijkl 


the complete design is a resolved BIB design and could be analyzed as such (see 
Section 9.3). It is, however, much simpler to use the factorial structure of the 
treatments and the particular system of confounding that has been employed. 

As mentioned earlier, the effects and the interaction can be estimated (unbi- 
asedly) only from those types of replicates^ in which they are not confounded. 
To emphasize this point, we write An,m, Bi m, AB\ \\ indicating that A is esti¬ 
mated from replicates of types II and III, that is, from r = 2q replicates; B is 
estimated from replicates of types I and III; and AB is estimated from replicates 
of types I and II. Consequently, the associated sums of squares are based on 
these estimates, as given in Table 9.1. Also, using the general formula (7.62), 
with r = 2q, n = 2, we have 


var(A[ U ii) = var(5 un ) = var(A5 U i) = —a] 

2q 

Following the procedure outlined in Section 8.5 we can partition SS(Blocks/ 
replicates) = |3, X p ) and SS(Residual)=SS(/|3, X p , X T ) of 

Table 9.1 as given in Table 9.2. The SS (Blocks/replicates) is further partitioned 
into a treatment component, associated with the estimates of A, B, AB from those 
types of replicates in which they are confounded, and SS (Remainder), which is 
algebraically equal to the interactions between the effects and the replicates in 
which the effects are confounded. The actual computation of the various sums 
of squares is indicated explicitly in Table 9.2a. Here, for example, A\ is the esti¬ 
mate of A obtained from observations in the q replicates of type I in which A 
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Table 9.2 Partitioning of Sums of Squares 


Source 

d.f. 

SS 


a. SS(Blocks/Replicates ) 


A 

1 

心 ] 2 

B 

1 

q [fin] 2 

AB 

1 

q [^Siii]" 

A x repSj 

a - 1 

i=l 

B x reps n 

Remainder q — 1 

[瓦 ii] 2 - 《 [5u] 2 

t=\ 

AB x reps m 

a - 1 

q 〜 〜 

[A5mi] — q [ABui] 2 

i=\ 


b. SS(Residual) 


Ax reps n m 

2q - 1 

ii, hi] 2 — 2^[An, hi] 2 

B x repsj ni 

2q - 1 

I, III] 2 — 2q[Bi III ] 2 

AB x reps! n 

2q - 1 

2 分 

i, n ] 2 -2 ? [A5!, n] 2 

m=l 


is confounded. Similarly, Aa is the estimate of A obtained from observations in 
the Ith replicate of type I. 

With regard to SS(Residual) it can be partitioned into three components each 
one algebraically equal to the interaction between an effect and replicates, in 
this case the replicates in which the corresponding effect is not confounded. 
The form of the sums of squares is given in Table 9.2b. The notation should be 
obvious now. 

9.2.3 Use of Intra- and Interblock Information 

In concluding the discussion of this introductory example of partial confounding, 
we point out that in this case we have both intra- and interblock information about 
A ， B ，and AB. If the block effects in model (9.1) were considered to be random 
effects, we could obtain combined estimates of the effects and interaction. We 
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know from our previous discussion that 

var(Ai Un ) = var(fi un ) = var(AB U i )= 七 cr 》 

and, as follows from Section 8.6, 

var(Ai) = var(Sn) = var(ABm)= 吾 (〜 2 + 

Hence the combined estimate of A, for example, is the weighted average of 
An, m and Aj, that is, letting w = l/o^ 2 and w r = l/(o^ + 2aj), 

2 = 2 — 石 1 ， m + gw^i (9 2) 

2qw + qw f 

with similar expressions for B and AB. (For a proof of this result we refer to 
Section 11.6.) Since w and w f or p = w/w f are usually not known, the quantities 
have to be estimated and then substituted into (9.2). We know that 

df = MS (Residual) 


and we also know that (see Section 8.6) 

a]- + 1(7、= MS (Remainder) 

There is, however, another way of estimating + 2<r,. This is accomplished 
through the B|T-ANOVA as given in Table 9.3 by utilizing all 3q d.f. for 
SS(X^*\0, Xp, X r ) and not just the 3(q — 1) d.f. for SS(Remainder). We com¬ 
ment briefly on how SS(Z^*|3, Z T ) is obtained in a way other than the 
usual as indicated in Section 1.3. 

As mentioned above, SS (Remainder) from the T|B-ANOVA is part of 
SS(Xp*\0, X p , X r ) because it is free of treatment effects. Since this SS accounts 
for 3(q — 1) d.f., there remain 3 d.f. to be accounted for. These are obtained by 
realizing that for each effect and interaction we have two estimates, namely one 
from replicates in which the effect is not confounded an 生 one from^eplicates in 
which the effect is confounded. Specifically, we have An,m and Bijn and 
Bn, and AB\ \\ and Obviously, the comparison of these two types of esti¬ 

mates is a function of block effects and error only, and hence the associated sum 
of squares belongs to SS(Xp*\0, X p , X r ). Since, for example, An,m is obtained 
from 2q replicates and A\ from q replicates and these estimates are uncorrelated, 
we have 


SS(Anjn — Ai)= 


[An ,hi - Ai] 2 
l/2q + l/q 


=f [An,ni - A ：] 2 


(9.3) 



SIMPLE CASE OF PARTIAL CONFOUNDING 


317 


Table 9.3 Structure of B|T-ANOVA for Partially Confounded Design 


Source 

d.f. 

SS 

E(MS) 

x P \o 

3q - 1 

From Table 9.1 


X T \0,X P 

3 





U 


X^*\0, X p , X T 


|(An,m-A t ) 2 

+ 

An ,hi vs. A\ 

1 

Bi hi vs. Bn 

1 

-j- (Si,m — 5 ii) 2 

a e + 

ABm vs. Am 

1 

y (Afii.il - Afim) 2 

+ 

Remainder 

Xq - 1) 

From Table 9.2a 

a e + 

Residual 

H2q - 1) 

From Table 9.2b 

〜 2 

Total 

I2q - 1 

From Table 9.1 



Similarly 

SS(Bi,iii — Bn) = — Bu ] 2 

(9.4) 


SS(ABi u — ABm) = 警 [A 召 i，ii - A 召 m ] 2 

(9.5) 

Since the comparisons Amu — A\, Bim — Bm, and AB\ \\ — AB\\\ are orthog¬ 
onal to each other, the SS (9.3) ，（ 9.4)，and (9.5) are orthogonal. Also, the 3 d.f. 


associated with (9.3), (9.4)，and (9.5) are not accounted for in SS(Remainder) 
since the comparisons leading to SS (Remainder) involve only comparisons among 
blocks from those replicates in which the respective effects are confounded, as 
is evident from Table 9.2a. 

Returning now to the estimation of + 2a^, to be used in (9.2), we evaluate 
as usual X p , X r )] assuming that the 的 】 are i 」 .d. random variables 

with mean zero and variance a,. We do this by obtaining the expected value 
of each component of SS(Z^*|3, Z T ) as given in Table 9.3. We already 
know that 

f'tSSCRemainder)] = 3(q — 1)(〜 2 + 2a^) (9.6) 

Now 

£[SS(A IU11 - Ai)] - ^var(A lun - (9.7) 
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since 


and 


var(A u ,in)=-< 


var(Ai) = ~i a e 2°^) 

The same result (9.7) is, of course, obtained also for SS(Bi m — Bn) and 
SS(AB\ n — Am). Hence, 


EmS(X^\3,X p ,X r )] 




+ 


+ 3(g — l)(cr^ + 2(7 》、 


.2 , - 1 

& 


An estimator for + 2a^ is then 

af+2aj = MS(X r |3, ^r) - MS(Residual)] 


(9.8) 


(9.9) 


We note here that if q is large, MS (Remainder) may actually be a quite satisfac¬ 
tory estimator for + 2g 、. 


9.3 PARTIAL CONFOUNDING AS AN INCOMPLETE 
BLOCK DESIGN 

In the previous section we have presented a simple example of partial con¬ 
founding together with the analysis based entirely on reasoning suggested by the 
factorial structure of the treatments and the corresponding allocation of the treat¬ 
ments to blocks. The design given in Figure 9.1, replicated q times, is, of course, 
an example of an incomplete block design, in fact a resolvable BIB design in this 
case as we pointed out earlier. As such, data from this design can be analyzed 
according to the general principles discussed in Chapter 1, or, more specifically, 
as in Chapter 2. We shall do this now and show that the resulting analysis agrees 
with that given in the previous section. 

9.3.1 Two Models 

Without loss of generality and for purpose of ease of notation only, we assume 
that the arrangement of treatments in blocks is the same for each repetition of 
the basic design of Figure 9.1. Using the model for an incomplete block design, 
we have 


y = jxO + + X t t + e 


(9.10) 
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with y = (yijktm), i = 1,2,... ,q indicating the repetition, y = 1, 2, 3 indicating 
the replication within repetition (as labeled in Fig. 9.1 )，k = 1,2 denoting the 
block within replication, and €，m = 0, 1 denoting the levels of factors A and 
B. The observation vector y consists then of q segments Cyniio, 1111 , ； VU200, 
: y/1201 ， M2101 ， ^2111, yi2200^ : y/2210, ^3100. ^311 b )^3210, 7/3201)’ 0 = 1,2,..., q) 
following Figure 9.1. Further, it can be deduced easily that 


X ■ 色 = 16q 乂 



(9.11) 


,0 10 0、 
0 0 0 1 
10 0 0 
0 0 10 
0 0 10 


X T = 3^ x 


1 

0 


0 10 0 


10 0 0 


0 0 0 1 


0 10 0 

\0 0 1 0 / 


and t’ = (too, no, toi, tu). 

Alternatively, we may write (9.1) as 


j = /x3 + + X T *r^e (9.12) 


where 




0 q X 


/I 

1 

-1 

-1\ 

1 

1 

1 

1 

1 

-1 

-1 

1 

1 

-1 

1 

-1 

1 

-1 

1 

-1 

1 

1 

1 

1 

1 

-1 

-1 

1 

1 

1 

-1 

-1 

1 

-1 

-1 

1 

1 

1 

1 

1 

1 

1 

-1 

-1 

\1 

-1 

1 

-1/ 


(9.13) 
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and r*' = (M, 士 A, ^B，^ABy The “factorial incidence matrix” Np 
is then of the form 


or 


N F = ^ q X 


Nf = 23^ x 


/2 2 2 

2-2 0 
0 0 2 

\0 0 0 


2 2 2 \ 

0 0 0 

-2 0 0 

0 2 - 2 ) 


/l 11 11 l\ 

1-10 00 0 

0 01-10 0 

\0 0 0 0 1 - 1 / 


9.3.2 Normal Equations 

We consider now the RNE for r*, that is, 

(X^Xr* - NfiX^X^N^r* = - N f^X X^y 

or 

(l2ql 4 - jNpN'p) t* = T* - \N f B 

where 



/ !2qM\ 


Will、 

T* = 

6qA 


召 112 

, B = 



6qB 




^ 6qABj 


\ B q^J 


are the effect totals and block totals, respectively. With 


and 


/6 0 

N F N' F =4q ° I 
v 0 0 


0 0\ 
0 0 
2 0 
0 2 / 


N F B 二 1 


/B... 

\ 

B.n - 

- ^ 12 

召 21 - 

- B.22 


(9.14) 

(9.15) 

(9.16) 

(9.17) 



PARTIAL CONFOUNDING AS AN INCOMPLETE BLOCK DESIGN 


321 


Eq. (9.16) reduces to 



/Sq - \A \ 


^6qA — (B.\i — 5.12 ) 、 


( 4^A n? in^ 


Sq • 


6qB — (5.21 — B. 22 ) 


4q Bi，ui 


j 


\6qAB — (5.31 — 5 . 32 ) j 


\4qABinj 

and hence to 




/A \ 
㊉ 


(A lU u\ 


since B.\\ — B .\2 = 2^Ai, B ， 2 \ — 2^5^ 5.31 —^32 = 2qABm, A = 

|(义1 + 2An,m), B = 士 (％ + 2Bim), AB = ^(ABm + 2ABm). 


Turning to the RNE for jS, we consider 
(X'pXfi — N' F (X' x ,X r *r l N F )fi = N'piX^X^X'^y (9.18) 


or 


21 




12^ 


N f F N F )P = B 


12^" 


A^r* 


with B and r* as defined in (9.17). With 


N f F N F = 4(V;) x 


■2 o i i i n 
0 2 1111 
112 0 11 
110 2 11 
11112 0 
■ 11110 2」 


= H0 q 0 f q ) x H say 

we see that the coefficient matrix for P ，that is, 




(6^/ - 0 q 3' q 


x H] 


(9.19) 


(9.20) 


is of rank 6q — 
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9.3.3 Block Contrasts 

From the form of (9.20) we can write out 6q — l identifiable functions of 
namely 3^ — 1 of the form 

Si: { 2(^71 + ~ Pq3\- Pqyi ).， i = 1,2,... j = l, 2 , 3 ; (ij) ^ (^ 3 )} 

3(q — 1) of the form 


Si' {2(^y 1 — Pij2 — Pqj\ + Pqjl ).， / = 1, 2,..., ^ — 1; y = 1, 2, 3} 
and 3 of the form 


s3 ： \-J2(Piji-Pij2yj = h2,3\ 


These three sets of identifiable functions are orthogonal to each other in the sense 
that any function in S v is orthogonal to any function in S v ^(v, v' = 1, 2, 3; y v f ). 
The corresponding RHSs are obtained by writing the RHS of (9.19) in the form 


2M +A 
2M -A 


B-3 q x 


2M + S 
2M-B 
2M-\-AB 
2M-AB 


We then find the following RHSs for 

5i ： {Bij. - / = 1 , 2...,^；7 = 1,2,3; (ij)^ Oq)\ 

$ 2 : {Bijl - Bij2 - B q j\ + Bqj2; / = 1， 2 …， g - 1; = 1 ， 2, 3} 

S 31 { B.\\ — B .\2 — 2qA, B ， 2 \ — B .22 ~ 2qB ， B.y[ — B.yz — 2qAB^ 

Within the context of the factorial calculus we can rewrite these for 52 and ^3 as 
Sr. {2(1, - A q ) (for j = 1 ) 

2(5, - B q ) (for j = 2) 

2(ABi - AB q ) (for ； = 3);; = 1, 2..., ^ - 1} 

{ Aq 〜〜 40 〜〜 4^7 ■^^^ 

~(^i — An,in )，~(^ii — 召 i,in) ， —(A^ni — A 召 i ， ii) 

since B/ii — _S/i 2 = 2An, B^i — = 2Bm, B^i — = 2ABmi. 
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We recognize that the comparisons S\ represent comparisons among replicates 
and the associated sum of squares is SS(Replicate) in Table 9.1. The comparisons 
of 52 and S 3 are comparisons of block effects within replicates and hence belong 
to X p , X z ). The sum of squares associated with 5^ is SS (Remainder) of 

Table 9.2, each set for 7 = 1,2, 3 leading to a sum of squares with q — I d.f. 
Finally, the three functions of S 3 give rise to the remaining three single d.f. sums 
of squares belonging to X p , X T ) as given in Table 9.3. 

We have thus shown that a system of partial confounding can be analyzed 
using the general principles of incomplete block designs. Although we have only 
considered the intrablock analysis of a particular case, this line of argument can, 
of course, be carried over to the combined analysis and to the general case. It is 
hoped, however, that the reader realizes that the analysis can be described and per¬ 
formed much easier via the concepts of factorial experiments as the factorial struc¬ 
ture of the treatments determines the structure of the incomplete block design. 


9.4 EFFICIENCY OF PARTIAL CONFOUNDING 

In general, reduction of block size will lead to a reduction of experimental error 
variance. On the other hand, this forces the experimenter to use a system of 
partial confounding, which may offset this gain. Therefore, if the experimenter 
has a choice of using blocks either of size 4 or of size 2, he or she needs a 
criterion on which to base the choice. For the simple case discussed so far, this 
is obtained by comparing the information provided by both designs (using the 
same number of replicates) as given in Table 9.4. 

The information of the partially confounded scheme relative to that of the 
scheme with no confounding is 

昏_ ( 9 . 21 ) 

where the subscript on cr 2 denotes the number of units per block. If is greater 
than 3 / 2(72 (or less than 2/3cr^), the information is greater with the partially 
confounded design. Equivalently, we say then that the efficiency of the partially 
confounded design relative to the unconfounded design, as given by (9.21), is 


Table 9.4 Information on Effects with Equal 
Confounding 



No Confounding 

Partial Confounding 
of Section 9.2 

A 

B 



< 


AB 
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Table 9.5 Information on Effects with Unequal 
Confounding 



No Confounding 

Partial Confounding 

A 

1 


B 

a A \ 



4 

2 q 

AB 




larger than 1. In general, will be greater than but whether it will be suffi¬ 
ciently greater to give the advantage to the partially confounded design depends 
on the experimental material. In many cases this may prove to be a difficult 
question. 

We have considered a scheme of partial confounding that results in equal 
information on main effects and the 2-factor interaction. In some cases it may 
be more appropriate to obtain greater information on main effects. This would 
entail greater representation of replicates of type III, to an extent depending on 
the relative amounts of information required. Suppose that a basic repetition 
consists of one replicate each of types I and II and 2 replicates of type III and 
that there are q such repetitions, that is, 4q replicates all together. The amounts 
of information to be compared then are as given in Table 9.5. 

Since the emphasis is on main effects, it seems reasonable to use the partial 
confounding scheme if 

The design utilizing partial confounding that we have just mentioned is obvi¬ 
ously only one of many choices. The number of choices becomes even larger 
as the number of factors increases. Some such cases will be discussed in the 
following sections. 


9.5 PARTIAL CONFOUNDING IN A 2 3 EXPERIMENT 

As we have mentioned earlier, the arrangements utilizing partial confounding 
that are best for any given situation depend on the information the experimenter 
wishes to obtain. Suppose, for example, with an experiment of three factors 
A, B, C, each at two levels, the experimenter desires maximum possible accuracy 
on main effects and equal information on the 2-factor and 3-factor interactions. 

9.5.1 Blocks of Size 2 

A suitable system of partial confounding will consist of a number of repetitions, 
q say, of the following types of replicates: 


Type I: Confound AB, AC, BC 
Type II: Same as type I 
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Type III: Confound A, BC, ABC 
Type IV: Confound B, AC, ABC 
Type V: Confound C,AB,ABC 

This design requires 5q replicates, that is, 20q blocks, which may not always be 
feasible. The information that this design yields as compared to the unconfounded 
design is given in Table 9.6. 

For an example using SAS PROC FACTEX see Table 9.14, Section 9.10.1. 


9.5.2 Blocks of Size 4 

In this case the experimenter will use a number of repetitions of the following 
basic pattern (with each column representing a block): 


I II 


(1) a 


(1) b 

ab b 


ac c 

c ac 


b ab 

abc be 


abc be 


III IV 


(1) b 


(1) a 

be c 


ab b 

a ab 


ac c 

abc ac 


be abc 


confounding AB, AC, BC, ABC in replicates of types I, II, III, IV, respectively. 

Suppose we use q repetitions, the positions of replicates, and blocks within 
replicates and treatment combinations within blocks being randomized. The infor¬ 
mation from this design and the corresponding unconfounded design is given in 
Table 9.7. It follows then that the partially confounded design will yield more 
information on interactions than the unconfounded design and substantially more 
information on main effects if is less than . 

Table 9.6 Information Given by Design for 2 3 System in Blocks of Size 2 

No Confounding Partial Confounding of Section 9.5.1 


Estimate from 



Information 

Estimate From 

Information 

Replicate of Types 

A 




Sq/cr^ 

I, II, IV, V 

B 




Sq/cr^ 

I, II, III, V 

AB 


lOq 



III, IV 

C 


All replicates 

%qlo^ 

I, II, III, IV 

AC 




III, V 

BC 





IV, V 

ABC 





I, II 
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Table 9.7 Information Given by Design for 2 3 System in Blocks of 4 



No Confounding 

Partial Confounding of Section 9.5.2 





Estimate from 


Information 

Estimate From 

Information 

Replicate of Types 

A 



8g/a 4 2 

All 

B 




All 

AB 



6q/a 4 2 

II ， III ， V 

C 

Sq 

3 

All replicates 


All 

AC 


6q/a 4 2 

I ， III ， V 

BC 



6q/a 4 2 

I ， II ， V 

ABC 



6g/a 4 2 

I ， II ， III 


Table 9.8 Structure of T|B-ANOYA 


Source of Variation 


d.f. 

Replicates 

4^ 

- 1 

B locks/replicates 



A 


1 

B 


1 

AB 


1 

C 


1 

AC 


1 

BC 


1 

ABC 


1 

Residual 

2Aq 

-7 

Total 

32q 

- 1 

B locks/replicates 


4q 

AB 


1 

AC 


1 

BC 


1 

ABC 


1 

AB x repsj 

q 

一 1 

AC x reps n 

q 

- 1 

BC x reps m 

q 

-1 

ABC x repSjy 

q 

- 1 
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Table 9.8 (Continued) 



Source of Variation 

d.f. 

c. 

Residual 

24q-l 


A x reps (all) 

4q - 1 


B x reps (all) 

4q - 1 


AB x reps (II, III, IV) 

3q - 1 


C x reps (all) 

4q - 1 


AC x reps (I, III, IV) 

3q - 1 


BC x reps (I, II, IV) 

3q - 1 


ABC x reps (I, II, III) 

3q - 1 


The partition of the degrees of freedom in the analysis of variance is given in 
Table 9.8a with a breakdown of the degrees of freedom for blocks in Table 9.8b 
and residual in Table 9.8c. 

The computation of the various sums of squares follows the general procedure 
as indicated in Tables 9.1 and 9.2 using the observations from the respective 
replicates according to Tables 9.7 and 9.8. 


9.6 PARTIAL CONFOUNDING IN A 2 4 EXPERIMENT 
9.6.1 Blocks of Size 2 

In this situation each replicate will consist of eight blocks. With s types of 
replicates, each utilizing a different system of confounding, there will be 8s — 1 
d.f. for blocks, 15 d.f. for treatments if no effect is completely confounded, and 
therefore Ss — 15 d.f. for residual. In order to have an error based on a reasonable 
number of d.f” it seems that we need at least 5 = 4 replicates. It follows from 
Table 8.3 that under these circumstances the best system of confounding appears 
to be as follows: 


Replicate Effects and Interactions Confounded 


I 

A, 

BC ， 

ABC, 

BD, 

ABD, 

CD, 

ACD 

II 

B, 

AC, 

ABC, 

AD, 

ABD, 

CD, 

BCD 

III 

c, 

AB, 

ABC, 

AD, 

ACD, 

BD ， 

BCD 

IV 

D, 

AB, 

ABD, 

AC, 

ACD, 

BC, 

BCD 


These four replicates give | relative information on main effects (in the sense that 
all main effects are estimated from three out of four replicates), ^ relative infor¬ 
mation on 2-factor interactions, ^ relative information on 3-factor interactions, 
and full information on the 4-factor interaction. 
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Since the number of observations is already quite large for one set of repli¬ 
cates, it does not seem to be practical to have repetitions of the basic pattern 
given above. 

9.6.2 Blocks of Size 4 

Referring again to Table 8.3, we find that a reasonable pattern of confounding is 
obtained by combining two systems of confounding as follows: 


Replicate Interactions Confounded 


I 

AB, 

CD ， 

ABCD 

II 

AC, 

BD ， 

ABCD 

III 

AD, 

ABC, 

BCD 

IV 

BC, 

ABD ， 

ACD 


This yields full information on main effects, | relative information on 2 - and 
3-factor interactions, and \ relative information on the 4-factor interaction. It 
also provides a sufficient number of degrees of freedom for residual. 

For a design using replicates of types I and II see Section 9.10.2. 

Another pattern of confounding, yielding equal information on all effects and 
interactions, is based on the completely orthogonalized 4x4 square given in 
Table 9.9. If we insert into each cell of this square the treatment combinations 
obtained by multiplying the treatment combinations listed for the correspond¬ 
ing row and column and then make up blocks of size 4 according to columns, 
rows, Latin letters, Greek letters, and numerals, we find the following pattern of 
confounding: 


Replicate 

Blocks 

Obtained From 

Effects and Interactions 
Confounded 

I 

Columns 

A,B,AB 

II 

Rows 

C,D，CD 

III 

Latin letters 

AC, BD,ABCD 

IV 

Greek letters 

AD, ABC, BCD 

V 

Numerals 

BC,ABD,ACD 


The reader will realize that this design is in fact a resolvable BIB design with 
parameters t = 16, b = 20, r = 5, k = 4, X = 1. 


9.6.3 Blocks of Size 8 

Most likely one would use here a system of complete confounding, namely con¬ 
founding ABCD. If some information is wanted on all interactions, a reasonable 
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( 1 ) 

a 

b 

ab 

( 1 ) 

Aal 

Bp2 

Cy3 

D84 

c 

By4 

A83 

Dal 

C/31 

d 

C82 

Dyl 


Ba3 

cd 

DP3 

Ca4 

BS1 

Ay2 


system of confounding would be to confound some or all of ABC, ABD, ACD, 
BCD, and ABCD, one in each replicate. 

For all the examples discussed in this section it should be clear what the 
structure of the analysis is. Also, these examples should serve as an illustration 
for more general systems of partial confounding for 2 n experiments. 


9.7 GENERAL CASE 

The discussion in the preceding sections should have given an indication of how 
the concept of partial confounding can be utilized very effectively for obtaining 
information about most or all effects and interactions using blocks of small size. 
It also has pointed out that there exist many different designs, that is, patterns 
of confounding, for a given situation. There does not seem to be a reasonable 
way of discussing the various designs in general. A careful choice must be 
made for each specific experiment based on the objectives of the experiment and 
prior knowledge or expectations about the experimental situation. All designs, 
however, have certain properties in common, and we shall comment on these as 
they affect various parts of the analysis. 

Suppose we have a 2 n factorial in blocks of size 2 P . We then have 2 n ~ p 
blocks in each replicate; that is, 2 n ~ p — 1 interactions are confounded in each 
replicate. Let a basic pattern of partial confounding consist of 5 types of repli¬ 
cates and let there be q repetitions of the basic pattern. We can then divide 
the totality of the 2 n — \ interactions into three mutually exclusive sets 
as follows: 


£i = {E ak , k = 1,2,..., n \； completely confounded} 
£2 = i = 1,2,, ri 2 \ partially confounded} 

£3 = {E 8m , m = 1,2,..., ^ 3 ； not confounded} 


with n\ -\- U 2 = 2 n — l. With regard to £ 2 , let E y ^ be confounded in c(yi) 
replicates and not confounded in u{yi) = s — c(yn) replicates. Finally, let N = 
2 n sq denote the total number of observations. 
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9.7.1 Intrablock Information 

Each E 8m e £3 is estimated from all replicates, and 


var ㈣ = 

and hence 

SS (£ ‘) = sq2 n ~ 2 E Sm 


(9.22) 


E n e £2 is estimated from qu{yi) replicates with 


and 


var 


( 和） 


qu(n)2 l 




(9.23) 


SS (E y ^) = qu(n)2 n ~ 2 [E y ^f (9.24) 


9.7.2 The ANOVAs 

The basic partitioning of the total sum of squares and its d.f. in the T|B-ANOVA 
is given in Table 9.10a, with a further partitioning of the block and residual sum 
of squares indicated symbolically in Tables 9.10b and 9.10c. The ANOVA is 
based on model (9.1). 

In Table 9.10a SS(E y ^) and SS(E 8m ) for E yi g £2 and E 8m e £3 are obtained 
from (9.22) and (9.24), respectively, and vr is obtained by subtraction. In 
Table 9.10b SS(E ak ) c and SS (五 h) c for E ak e £1 and E y ^ e 8,2 are obtained 
from the replicates in which these interactions are confounded, that is, 


and 


Similarly 


and 


SS (E ak ) c = sq2 n ~ 2 [E ak f 
SS (E^) c = qc{ Yl 2 n - 2 ) [ 办] 2 


SS ㈣） = 2 n ~ 2 

r n2 

-哲： 


r 〜,， "I 2 

SS ㈣ ， 2"- 2 

E y t 

L v 」 


where (ij) denotes the replicate j in repetition i and Y^ij denotes summation 

over all replicates in which a given E y ^ is confounded, that is, qc(yi) replicates. 

Also, in Table 9.10c SS(E^f) and SS(Ef^) are obtained from those replicates 

ij 

in which the corresponding effects are not confounded, that is, 
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Table 9.10 T|B-ANOYA for Partial Confounding 


Source 

d.f. 

SS 



a. Basic Partitioning 


x P \o 

sq — l 

Usual 


x^\0, x p 

sq(2 n ~P - 1) 

Usual 


X z \3, X Pi Xp* 

ri2 + «3 

SS(E y n + J2 SS(E Sm ) 

l m 

I\0, X p , X^*,X T 

VR 

Difference 


Total 

N - 1 

Usual 



b. Partitioning of Block Sum of Squares 


x^\ 0 , x p 

sq(2 n ~P - 1) 



[E ak ] 

n\ 

J2 SS(E ak ) c 
k 


{E^} 

ni 

J2 SS(£ y 0c 

t 


[E ak x reps} 

n\{sq - 1) 

EE ss ( s # 

k ij 

-J2 SS(E a ” c 

k 

[E yi x reps} 

q ^2 ~ ni 

i 

E E ,ss (4 〜 

^ U 

-J2 ss ( 奶 ) c 

t 


c. Partitioning of Residual Sum of Squares 


I\0, x p , X^, X T 




[E yi x reps} 

q X! 一 《2 

t 

E E 〃 ss (4 e ) 

^ ij 

- J2 SS(£h) 

l 

[E 8m x reps} 

n 3 (sq - 1) 

EE SS ( 畸 ) - 

m ij 

- J2 SS(E s ” 

m 

and 

SS (O = 2' 卜 2 

2 

[ E ij \ 


and Y^ r denotes summation over all replicates (ij) in which a given E y ^ is not 
confounded, there being qu(yi) such replicates. 
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Table 9.11 B|T-ANOVA for Partial Confounding 


Source 

d.f. 

SS 


a. Basic Partitioning 

x P \o 

sq — \ 

Usual 

X T \0,X p 

2 n - 1 

Usual 

x^\o,x p ,x T 

sq(2 n ~P -l)-m 

Difference 

I\0,X p ,X T ,X^ 

VR 

From Table 9.10 

Total 

N - 1 

Usual 


b. Partitioning of Block Sum of Squares 

x^\o,x p ,x T 

sq{T~P -\)-n x 


如 vs . 私 

ni 

^2,c(yi)u{Y()[E y ^ E y ^] 

s i 

[E ak x reps} 

n\{sq - 1) 

From Table 9.10b 

[E y ^ x reps} 

q^c(Yi) - n 2 
t 

From Table 9.10b 


The B|T-ANOVA in its basic form is given in Table 9.11a and a partitioning 
of the block sum of squares is given in Table 9.11b. Here 


with 


SS (E^e vs. 扣 ) 


var (E ye — E rt ) 


var (Ere - EVi) ja} 


y qu(yt)2 n ~ 2 qc{yi)2 n ~ 2 y 


qu(n)c(ye)2 n - 


9.7.3 Interblock Information 

Assuming P* to be a vector of i.i.d. random variables with E(P^) = 0, 

= we can obtain interblock information on the E ak e and E y£ e S 2 , 
namely 

E (E ak ) = E ak with var (S ak ) = . (cr ； + 


and 


E = E Yl with var(£^ £ ) 


sq2 n ~ 2 

1 


qc(yi)2 n ~ 


2 S) 

e+ 2 p crfj (9.25) 
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9.7.4 Combined Intra- and Interblock Information 

Combined information is available for E yi e £2 - Using (9.23) and (9.25)，we 
obtain the combined estimator 

^ u(yi)wE yi + c(yi)w f E yi 

E" = —^ - - (9.26) 

u{yt)w + c{yt)w f 

where, as usual, w = = l/(o^ 2 + 2 p a^). Letting w f /w = p, we have 


var (Ey^\ = t - x -- 

、 ' [u(yt)w + c(yt)w r \ q2 n ~ 2 

^2 


+ c(yi)p]q2 n ~ 2 


(9.27) 


9.7.5 Estimation of Weights 

As always, we estimate and hence w from the T|B-ANOVA, and using the 
Yates procedure, we estimate and hence w/ from the B|T-ANOVA. From 
Table 9.10 we obtain 

SS(I,X p ,X^,X r ) 

^2 = p， = MS(/|3, Xfi*, X r ) (9.28) 

and hence w = 1/5^. 

In order to estimate w r we consider the expected values of the sums of squares 
that form SS(X^*\3, X p , X T ) as given in Table 9.11b. Under the mixed model 
assumption given in Section 9.7.3 above we find, for each E y ^ e £ 2 , 

E (E y i - E y if = var (£ 〜一 

一 1 

_ q2' l _ 2 

_ _^_[" 2 , u (n)2 p 2 

q2 n ~ 2 u{Yl)c(Yl) ° e s 

and hence, from (9.29) and Table 9.1 lb, 

ESS {E y t vs. E y i) = a ； + 

Since 

ESS (E ak x reps) = (sq - 1) (a; + 2 ， )oj) 
and 

ESS (E yi x reps) = [qc(yi) - l] + 2 p cxj 、 


(9.29) 

(9.30) 

(9.31) 

(9.32) 




c(n) 


+ ^ P °t) 
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we obtain, by using (9.30), (9.31), and (9.32) in Table 9.11b, 

EMS (Xp ， \0,X p ,X r )^^ + A2Pcrj 

where 

(1A) + ni(sq - l) + c(n) - n 2 

A = _^^_ 

sq(2 n ~P - 1) - « ] 

From (9.33), (9.34), and (9.28) we then obtain 

ol + 2 印 , =• = (1 - 去 ) MS(/|a,Z p , Xp*,Xr) 

+ ^MS(Xfi*\3,X p ,X x ) 


(9.33) 


(9.34) 


(9.35) 


An interesting consequence of our derivation of X p , X x ) is that 

because of (9.30), that is, the heterogeneous expected values of the components 
of SS(X^*\3, X p , X T ), it follows that even under the assumption of normal¬ 
ity SS(X^*\0, Xp, X T ) does not follow a scaled x 2 distribution but rather is 
distributed as a linear combination of scaled x 2 distributions. 

Of course, as we have discussed earlier (see Section 1.11), other estimation 
procedures can be used to obtain estimates of and a, (see Section 9.10 for 
numerical examples using such procedures). 


9.7.6 Efficiencies 

Under a system of no confounding with r = qs replicates per treatment com¬ 
bination (=number of blocks of size 2 n ) and a particular pattern of partial 
confounding, we can compare the following amounts of information: 

No Confounding Partial Confounding 

€ £] sq2 n ~ 2 /a^ 0 

e £2 sq2 n - 2 lal uiydql 11 - 1 /a 、 

E s '" e £3 sqT^lal sq2 n ~ 2 /al 

where denotes for blocks of size 2 ” and <r^ denotes for blocks of size 
2 P . From this we can compute the efficiency for each interaction and determine 
when the information on interactions in £2 and S 3 under confounding is at least 
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as good as that without confounding depending on the ratio / o^. Excepting 
interactions in £1 we can express the average efficiency as 


^uiyt) n^s 


j (ri2 + ri3)s 




9.8 DOUBLE CONFOUNDING 

In certain experimental situations it may be necessary to use more sophisticated 
systems of confounding than we have discussed up to this point. Consider, for 
example, a 2 5 experiment, where each treatment is applied to a certain material 
that is then processed by a machine. Homogeneous batches of material, sufficient 
for four runs (treatment combinations), are available and, in order to complete the 
experiment in a reasonable time, four machines will be used; that is, each machine 
will handle eight pieces of material from eight different batches, each piece of 
material being treated differently (by one of the 2 5 treatment combinations). In 
order to eliminate batch-to-batch variation we might consider each batch as a 
block of size 4. Elimination of possible systematic differences among machines 
suggests to consider each machine as a block of size 8. Combining these two 
types of blocking and their corresponding systems of confounding leads to an 
arrangement of the treatment combinations in a rectangular array that is referred 
to as a system of double confounding. For the present example this is illustrated 
in Table 9.12. 

This arrangement is obtained by confounding ABC, CDE, ABDE with rows 
(machines) and AB ， CD, ABCD, BDE ， ADE, BCE, ACE with columns 
(batches) (see Table 8.3) and then constructing the intrablock subgroup for each 
system of confounding. These subgroups represent the first row and first col¬ 
umn, respectively, in Table 9.12. The remaining treatment combinations are then 
filled in by taking products of appropriate treatment combinations according 
to the usual rules. In the actual experiment rows and columns are, of course, 
randomized. 


Table 9.12 2 5 Experiment in 4 x 8 Pattern 


Batches 



1 

2 

3 

4 

5 

6 

7 

8 

1 

(1) 

ab 

ace 

bee 

abde 

de 

bed 

acd 

. 2 

abe 

e 

be 

ac 

d 

abd 

aede 

bede 

Machines ^ 

cde 

abede 

ad 

bd 

abc 

c 

be 

ae 

4 

abed 

cd 

bde 

ade 

ce 

abce 

a 

b 
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Table 9.13 Analysis of Variance 
for Double Confounding Scheme 


Source of Variation 

d.f. 

Replicates 

1 

Rows/replicates 

6 

Columns/replicates 

14 

A, B, ABjj, C, AC\, BC, 


D, AD, BD, CD\\, E, AE, 

15 

BEi, CE, DE 


Residual 

27 


The observation in the jth row and kth column receiving the £th treatment is 
represented by the model 

yjkt = M + P ； + n + ^ + eju (9.36) 

where the terms have the obvious meaning. However, this particular design is of 
little value unless repeated, for the only means of estimating the error variance 
is to use unconfounded interactions involving three or more factors, so that only 
8 d.f. are available. In replications it would be well to use different schemes 
of confounding (i.e., partial confounding) so that some within-row-and-column 
information could be obtained on all important effects and interactions. If different 
sets of machines are used in different replicates, model (9.36) will be extended 
as follows: 


yijki = /X + a/ + Pij + Yik + Q + eijki (9.37) 

where at is the effect of the ith replicate, pij the effect of the jth row (machine) 
in the /th replicate, yik the effect of the 众 th column (batch) in the ith replicate, 
the effect of the ith treatment combination, and etjki the error associated with the 
unit in the jth row and 她 column of the ith replicate receiving the ith treatment. 
The structure of the ANOVA for model (9.37) is exhibited in Table 9.13 using 
two replicates and confounding in the second replicate ACD, BCE，ABDE 
with rows, AC, BE, ABCE, BCD, ABD, CDE, ADE with columns. We 
assume that interactions involving three or more factors are negligible. The d.f. 
for error in Table 9.13 can be broken down into 12 d.f. from unconfounded 
or partially confounded higher order interactions and 15 d.f. from comparisons 
of unconfounded effects and interactions in the two replicates, for example, A\ 
versus An. 


9.9 CONFOUNDING IN SQUARES 

In the previous section we have discussed systems of confounding in a rectangular 
array utilizing in general columns as blocks of size 2 〃 and rows as blocks of size 
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2 n ~ p . A special case, p = n!l ，would lead to a square array. More generally, 
however, confounding in squares refers to any type of square array with blocks 
of size 2 P in both directions. Then every treatment combination appears 2 2p ~ n 
times in the array. We shall consider a few specific examples illustrating the 
procedure (Kempthome, 1952, Chapter 15). 

9.9.1 2 3 Factorial in Two 4x4 Squares 

We need to confound one interaction with columns and one interaction with rows, 
and since two replicates are necessary to get an adequate estimate of the error 
variance, we may completely confound these two interactions, or we may partially 
confound four interactions, or we may partially confound two interactions (say, 
two 2-factor interactions) and completely confound one interaction (say, the 3- 
factor interaction). Examples of each follow. 

1. If ABC is confounded with columns and BC with rows, both replicates will 
be randomizations of one of the squares 


(1) 

a 

be 

abc 

(i) 

a 

be 

abc 

ab 

b 

ac 

c 

ab 

c 

ac 

b 

ac 

c 

ab 

b 

or 

ac 

b 

be 

c 

be 

abc 

(1) 

a 

be 

abc 

(1) 

a 


Note that the first square is constructed by the method described in Section 
9.8. Also, the ANOVA has a structure similar to that given in Table 9.13. 

2. If ABC and BC are confounded with columns and rows, respectively, in one 
square, and AB and AC are confounded with columns and rows, respectively, 
in the second square, we obtain the following result: 


(i) 

a 

be 

abc 

(1) 

a 

abc 

ac 

ab 

b 

ac 

c 

.ab 
and , 
abc 

b 

c 

be 

ac 

c 

ab 

b 

be 

(1) 

b 

be 

abc 

(1) 

a 

c 

ac 

ab 

a 


Here we obtain full information on main effects and ^ relative information 
on interactions. As a result, however, the error d.f. are reduced from 13 to 
11 compared to the previous design. 

3. If we confound ABC with columns in both squares, BC with rows in one 
square, and AB with rows in the second square, we obtain 


⑴ 

a 

be 

abc 

a) 

c 

ab 

abc 

ab 

b 

ac 

c 

.ab 
and 

ac 

abc 

(1) 

c 

ac 

c 

ab 

b 

a 

be 

b 

be 

abc 

(1) 

a 

be 

b 

ac 

a 
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9.9.2 2 4 Factorial in 8 x 8 Squares 

An interesting alternative to completely confounding two interactions with rows 
and columns is the following: Partially confound four 3-factor interactions with 
row pairs and completely confound the 4-factor interaction with the four column 
pairs. This leads to the following design, where ABC, ABD, ACD, BCD are 
confounded with row pairs I, II, III, IV, respectively: 


(1) 

d 

ab 

abd 

ac 

acd 

be 

bed 

ad 

a 

bd 

b 

cd 

abc 

abed 

c 

ab 

c 

(1) 

abc 

ad 

bed 

bd 

acd 

ac 

b 

be 

a 

abed 

abd 

cd 

d 

cd 

abc 

ac 

bed 

(1) 

b 

ad 

abd 

be 

acd 

abed 

c 

bd 

d 

ab 

a 

bd 

abd 

cd 

acd 

be 

a 

(1) 

abc 

abed 

bed 

ad 

d 

ab 

c 

ac 

b 


This design will provide | relative information on the 3-factor interactions 
ABC, ABD ， ACD, and BCD, no information on ABCD, and full information 
on main effects and 2-factor interactions. If replications can be done, other sys¬ 
tems of partial confounding may be used. The structure of the ANOVA is similar 
to that given in Table 9.13. 

Yates (1937a) has given arrangements for the 2 5 and 2 6 experiment in 8 x 8 
squares. Through the process of randomizing rows and columns, these arrange¬ 
ments may lead (with considerable frequency) to undesirable configurations. 
Grundy and Healy (1950) have provided solutions to this problem by using the 
concept of restricted randomization. 


9.10 NUMERICAL EXAMPLES USING SAS 

In this section we shall illustrate the use of PROC FACTEX to construct designs 
with partial confounding and PROC GLM and PROC MIXED to analyze data 
from such designs. 

9.10.1 2 3 Factorial in Blocks of Size 2 

Using one replicate each of types III, IV, and V as defined in Section 9.5.1 leads 
to a design with partial confounding of A, B, C, AB, AC, BC and complete 
confounding of ABC. The SAS input and output are given in Table 9.14. 
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Table 9.14 2 3 Factorial in Blocks of Size 2 Using Partial Confounding 


options nodate pageno=l ； 

proc factex ； 

factors ABC; 

blocks size=2 ； 

model est=(A B A*C B*C); 

examine design confounding ； 

output out=replicatel blockname=block nvals=(1 2 3 4); 
titlel 'TABLE 9.14'; 

title2 '2**3 FACTORIAL IN BLOCKS OF SIZE 2'; 

title3 'USING PARTIAL CONFOUNDING 1 ; 

title4 '(CONFOUNDING ABC, AB AND C IN REP 1)'; 

run ； 

proc factex ； 

factors ABC; 

blocks size=2 ； 

model est=(A C A*B B*C); 

examine design confounding ； 

output out=replicate2 blockname=block nvals=(5 6 7 8); 

title4 '(CONFOUNDING ABC, AC AND B IN REP 2'; 

run ； 

proc factex ； 

factors ABC; 

blocks size=2 ； 

model est=(B C A*B A*C); 

examine design confounding ； 

output out=replicate3 blockname=block nvals=(9 10 11 12); 

title4 '(CONFOUNDING ABC, BC, AND A IN REP 3)'; 

run ； 

data combine ； 

set replicatel replicate2 replicate3 ； 
run ； 

proc print data=combine ； 

title4 '(PARTIALLY CONFOUNDING A, B, AB, C AC, BC'; 

title5 'AND COMPLETELY CONFOUNDING ABC) 1 ; 

run ； 
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Table 9.14 (Continued) 


2**3 FACTORIAL IN BLOCKS OF SIZE 2 
USING PARTIAL CONFOUNDING 
(CONFOUNDING ABC, AB AND C IN REP 1) 


The FACTEX Procedure 
Design Points 


Experiment 


Number 

A 

B 

C 

Block 

1 

-1 

-1 

-1 

1 

2 

-1 

-1 

1 

4 

3 

-1 

1 

-1 

2 

4 

-1 

1 

1 

3 

5 

1 

-1 

-1 

2 

6 

1 

-1 

1 

3 

7 

1 

1 

-1 

1 

8 

1 

1 

1 

4 


Block Pseudofactor Confounding Rules 

[Bl] = A*B*C 
[B2] = C 


2**3 FACTORIAL IN BLOCKS OF SIZE 2 
USING PARTIAL CONFOUNDING 
(CONFOUNDING ABC, AC AND B IN REP 2) 

Design Points 


Experiment 

Number ABC Block 


1 -1 

2 -1 

3 -1 

4 -1 

5 1 

6 1 

7 1 

8 1 



1 

-1 

1 

-1 

1 


3 
2 

4 
1 
2 

3 
1 

4 


Block Pseudofactor Confounding Rules 

[Bl] = A*B*C 
[B2] = A*C 
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Table 9.14 


(Continued) 


2**3 FACTORIAL IN BLOCKS OF SIZE 2 
USING PARTIAL CONFOUNDING 
(CONFOUNDING ABC, BC, AND A IN REP 3) 

Design Points 

Experiment 

Number ABC Block 


1 

2 

3 

4 

5 

6 

7 

8 





3 

2 

2 

3 

4 
1 
1 
4 


Block Pseudofactor Confounding Rules 

[Bl] = A*B*C 
[B2] = B*C 

2**3 FACTORIAL IN BLOCKS OF SIZE 2 
USING PARTIAL CONFOUNDING 
(PARTIALLY CONFOUNDING A, B, AB, C, AC, BC 
AND COMPLETELY CONFOUNDING ABC) 


Obs 


block 


A 


B 


C 
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Note that there are three input statements, one for each replicate, and that 
the ESTIMATE statement specifies which effects should be estimable and hence 
which effects should be confounded in a given replicate. 

Various aspects of the analysis of data from the design of Table 9.14 are 
illustrated in Table 9.15 

The intrablock analysis is performed with PROC GLM and the combined anal¬ 
ysis with PROC MIXED. The output is fairly self-explanatory, but we shall make 
the following comments, mainly to relate the numerical output to the theoretical 
developments in this chapter: 


1. For the intrablock analysis, that is, the GLM procedure, estimates of all 
effects are obtained from the replicates in which they are not confounded 
with blocks. For example, the main effect A is estimated from replicates 1 
and 2, that is, blocks 1-8, yielding A = 3.0. 

2. The standard error for A is obtained from 

var(A) = — = - - 7.5833 = 1.8958 
2. 2 3-2 4 

as 


se(A) = = 1.3769 


3. Concerning the combined analysis using PROC MIXED, we notice that the 
estimate for obtained here, namely = 8.0297, is different from the 
estimate obtained from PROC GLM. The reason for this is that and 
are estimated jointly using the REML procedure (see Section 1.11.2) rather 
than the ANOVA procedure as outlined in Section 9.7. 

4. With = 8.027 and = 2.4656 the weights w and w r (see Section 9.7.5) 
are estimated as 


w = — = .1245 


w r 


1 


+ 25 j 


12.96 


.0772 


5. To illustrate the combining of intra- and interblock information, we consider 
estimation of A. From the intrablock analysis we have Am = 3.0. From 
replicate 3 we find Am = 5.50. Then [see (9.26)] 


A = 


2 x .1245 x 3.0 + .0772 x 5.50 
2 x .1245+ .0772 


= 3.5916 


which is (apart from rounding error) the value given in the output. 
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Table 9.15 Analysis of 2 3 Factorial With Partial Confounding 


options nodate pageno=l ； 
data confound ； 
input rep block A B C y @@； 
datalines ； 

110008 25011 22 39101 39 

11110 13 25110 25 39110 32 

12010 15 26001 20 3 10 0 0 1 28 

121009 26100 18 3 10 0 1 0 26 

13011 11 27000 20 3 11 0 0 0 30 
13101 18 27101 21 3 11 011 33 
140017 28010 17 3 12 1 0 0 27 

14111 19 28111 21 3 12 111 41 

/ 

run ； 

proc print data=confound ； 
titlel 'TABLE 9.15'; 

title2 '2**3 FACTORIAL WITH PARTIAL CONFOUNDING'; 
run ； 


proc glm data=confound; 

class rep block ABC; 

model y= rep block(rep) A|B|C@2 ； 

estimate 'Main effect A' A -1 1 ； 

estimate 'Interaction AB' A*B 1 -1 -1 l/divisor=2 ； 

title3 'INTRA-BLOCK ANALYSIS'; 

run ； 

proc mixed data=confound ； 

class rep block A B C ； 

model y= rep A|B|C/ddfm=satterth; 

random block(rep )； 

estimate 'Main effect A' A -1 1; 

estimate 'Interaction AB 1 A*B 1 -1 -1 l/divisor=2 ； 

estimate 'Interaction ABC' A*B*C 1 -1 -1 1 -1 1 1 -l/divisor=4 ； 

title3 'COMBINED INTRA- AND 工 NTER-BLOCK ANALYSIS 1 ; 

run ； 


proc glm data=confound; 
class rep block ABC; 
model y= rep A|B|C block(rep); 
title3 'AUXILIARY ANALYSIS 1 ; 
run ； 
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Table 9.15 (Continued) 


Obs 

rep 

block 

A 

B 

c 

y 

1 

1 

1 

0 

0 

0 

8 

2 

2 

5 

0 

1 

1 

22 

3 

3 

9 

1 

0 

1 

39 

4 

1 

1 

1 

1 

0 

13 

5 

2 

5 

1 

1 

0 

25 

6 

3 

9 

1 

1 

0 

32 

7 

1 

2 

0 

1 

0 

15 

8 

2 

6 

0 

0 

1 

20 

9 

3 

10 

0 

0 

1 

28 

10 

1 

2 

1 

0 

0 

9 

11 

2 

6 

1 

0 

0 

18 

12 

3 

10 

0 

1 

0 

26 

13 

1 

3 

0 

1 

1 

11 

14 

2 

7 

0 

0 

0 

20 

15 

3 

11 

0 

0 

0 

30 

16 

1 

3 

1 

0 

1 

18 

17 

2 

7 

1 

0 

1 

21 

18 

3 

11 

0 

1 

1 

33 

19 

1 

4 

0 

0 

1 

7 

20 

2 

8 

0 

1 

0 

17 

21 

3 

12 

1 

0 

0 

27 

22 

1 

4 

1 

1 

1 

19 

23 

2 

8 

1 

1 

1 

21 

24 

3 

12 

1 

1 

1 

41 


INTRABLOCK ANALYSIS 

The GLM Procedure 

Class Level Information 

Class Levels Values 

rep 3 123 

block 12 123456789 10 11 12 

A 2 0 1 

B 2 0 1 

0 1 


C 


2 
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Table 9.15 (Continued) 


Number of observations 24 


Dependent Variable : y 




Sum of 




Source 

DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 

17 

1889.833333 

111.166667 

14.66 

0.0016 

Error 

6 

45.500000 

7.583333 



Corrected Total 

23 

1935.333333 




R-Square 


Coeff Var 

Root MSE 

y Mean 


0.976490 


12.70978 

2.753785 

21.66667 



Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

rep 

2 

1537.333333 

768.666667 

101.36 

<.0001 

block(rep) 

9 

127.000000 

14.111111 

1.86 

0.2316 

A 

1 

36.000000 

36.000000 

4.75 

0.0722 

B 

1 

36.000000 

36.000000 

4.75 

0.0722 

A*B 

1 

12.250000 

12.250000 

1.62 

0.2508 

C 

A*C 

1 

56.250000 

56.250000 

7.42 

0.0345 

1 

81.000000 

81.000000 

10.68 

0.0171 

B*C 

1 

4.000000 

4.000000 

0.53 

0.4950 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

rep 

2 

1537.333333 

768.666667 

101.36 

<.0001 

block(rep) 

9 

119.833333 

13.314815 

1.76 

0.2539 

A 

1 

36.000000 

36.000000 

4.75 

0.0722 

B 

1 

36.000000 

36.000000 

4.75 

0.0722 

A*B 

1 

12.250000 

12.250000 

1.62 

0.2508 

C 

1 

56.250000 

56.250000 

7.42 

0.0345 

A*C 

1 

81.000000 

81.000000 

10.68 

0.0171 

B*C 

1 

4.000000 

4.000000 

0.53 

0.4950 


Parameter 

Estimate 

Standard 

Error 

t Value 

Pr > |t| 

Main effect A 

3.00000000 

1.37689264 

2.18 

0.0722 

Interaction AB 

1.75000000 

1.37689264 

1.27 

0.2508 
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Table 9.15 (Continued) 

2**3 FACTORIAL WITH PARTIAL CONFOUNDING 
COMBINED 工 NTRA- AND INTERBLOCK ANALYSIS 

The Mixed Procedure 

Model Information 

Data Set WORK.CONFOUND 

Dependent Variable y 

Covariance Structure Variance Components 

Estimation Method REML 

Residual Variance Method Profile 

Fixed Effects SE Method Model-Based 

Degrees of Freedom Method Satterthwaite 


Class Level Information 


Class 

Levels 

Values 

rep 

3 

1 

2 

3 

block 

12 

1 

2 

3 4 5 6 7 8 9 10 11 12 

A 

2 

0 

1 


B 

2 

0 

1 


C 

2 

0 

1 



Dimensions 

Covariance Parameters 
Columns in X 
Columns in Z 
Subjects 

Max Obs Per Subject 
Observations Used 
Observations Not Used 
Total Observations 


Iteration History 


Iteration 

Evaluations 

-2 Res Log Like 

Criterion 

0 

1 

83.94834175 


1 

2 

83.76036493 

0.00000969 

2 

1 

83.76008026 

0.00000000 


2 

30 

12 

1 

24 

24 

0 

24 




NUMERICAL EXAMPLES USING SAS 


347 


Table 9.15 (Continued) 


Convergence criteria met. 

Covariance Parameter 
Estimates 


Cov Parm 


block(rep) 
Residual 


Estimate 

2.4656 
8.0297 



Fit Statistics 


—2 Res Log Likelihood 

83.8 

AIC 

(smaller is better) 

87.8 

AICC 

(smaller is better) 

88.9 

BIC 

(smaller is better) 

88.7 


Type 3 Tests of Fixed Effects 


Effect 

Num DF 

Den DF 

F Value 

Pr > F 

rep 

2 

4.56 

59.31 

0.0005 

A 

1 

9.98 

8.41 

0.0158 

B 

1 

9.98 

4.57 

0.0584 

A*B 

1 

9.98 

0.63 

0.4464 

C 

A*C 

1 

9.98 

7.79 

0.0191 

1 

9.98 

6.19 

0.0321 

B*C 

1 

9.98 

0.11 

0.7481 

A*B*C 

1 

4.56 

1.85 

0.2370 


Estimates 


Standard 


Label 


Estimate 

Error 

DF 

t Value 

Pr > |t| 

Main effect 

A 

3.5913 

1.2380 

9.98 

2.90 

0.0158 

Interaction 

AB 

0.9814 

1.2380 

9.98 

0.79 

0.4464 

Interaction 

ABC 

2.0000 

1.4697 

4.56 

1.36 

0.2370 
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Table 9.15 (Continued) 


2**3 FACTORIAL WITH PARTIAL CONFOUNDING 
AUXILIARY ANALYSIS 


The GLM Procedure 


Class Level Information 


Class Levels Values 


rep 

block 

A 

B 

C 


3 12 3 

12 123456789 10 11 12 

2 0 1 

2 0 1 

2 0 1 


Number of observations 24 


Dependent Variable : y 




Sum of 




Source 

DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 

17 

1889.833333 

111.166667 

14.66 

0.0016 

Error 

6 

45.500000 

7.583333 



Corrected Total 

23 

1935.333333 





R-Square 

Coeff Var 

Root MSE 

y Mean 

0.976490 

12.70978 

2.753785 

21.66667 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

rep 

2 

1537.333333 

768.666667 

101.36 

< . 0001 

A 

1 

88.166667 

88.166667 

11.63 

0.0143 

B 

1 

37.500000 

37.500000 

4.95 

0.0678 

A*B 

1 

2.666667 

2.666667 

0.35 

0.5748 

C 

1 

66.666667 

66.666667 

8.79 

0.0251 

A*C 

1 

37.500000 

37.500000 

4.95 

0.0678 

B*C 

1 

0.166667 

0.166667 

0.02 

0.8870 

A*B*C 

1 

24.000000 

24.000000 

3.16 

0.1256 

block(rep) 

8 

95.833333 

11.979167 

1.58 

0.2973 
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Table 9.15 (Continued) 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

rep 

2 

1537.333333 

768.666667 

101.36 

<.0001 

A 

1 

36.000000 

36.000000 

4.75 

0.0722 

B 

1 

36.000000 

36.000000 

4.75 

0.0722 

A*B 

1 

12.250000 

12.250000 

1.62 

0.2508 

C 

1 

56.250000 

56.250000 

7.42 

0.0345 

A*C 

1 

81.000000 

81.000000 

10.68 

0.0171 

B*C 

1 

4.000000 

4.000000 

0.53 

0.4950 

A*B*C 

0 

0.000000 




block(rep) 

8 

95.833333 

11.979167 

1.58 

0.2973 


Using (9.27)，we obtain 


var (A , 


8.0297 

(2 + 0.6201)2 


1.5323 


and hence 


se 



=1.2379 


which is (apart from rounding error) the value given in the output. 
6 . Using all three replicates, we find 

ABC = 2.0 


with 


and 


var [ABC^j = 


W = 2. 16 


se (XBC^j = 1.4697 


7. We use the results of Table 9.15 — auxiliary analysis — to illustrate the numer¬ 
ical implementation of some of the results of Sections 9.7.4 and 9.7.5 The 
reader will recognize that the type I SS are those of the B|T-ANOVA in 
Table 9.11. Its main purpose is the estimation of w and w f . Obviously, 


^ 1 

w = - 

7.5833 


=.1319 
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and with A = 0.75 in (9.34)，we obtain 

w' = (-0.3333 X 7.5833 + 1.3333 x 11.9792)- 1 = 0.0744. 
The values for w and w f are used to obtain, for example, 

0.1319 x 3.0 + 0.0744 x 5.5 


1 0.1319 + 0.0744 

[see (9.26) and from (9.27)], 

7.5833 


3.9016 


var ^4J 
s,(a) 


(2 + 0.5641)2 
1.2160 


1.4787 


which is in good agreement with the results obtained in 5. 


9.10.2 2 4 Factorial in Blocks of Size 4 

Assuming that the 4-factor into action ABCD is negligible and that we want full 
information on all main effects, using eight blocks of size 4, a suitable design 
(from inspection of Table 8.3) is to confound AB and CD and, hence, ABCD in 
one replicate and AC and BD and, hence, ABCD in the other replicate. Such a 
design is given in Table 9.16, generated by PROC FACTEX. This design results 
in the following three sets (see Section 9.7): 

£i = {ABCD} n\ = 1 

£ 2 = {AB, AC, BD, CD} n 2 = 4 

£3 = [A, B, C, D, AD, BC, ABC, ABD, ACD, BCD] n 3 = 10 

with zero information on £ 1 , ^ information on £ 2 , and full information on S 3 . 

The analysis of a data set using the design of Table 9.16 is given in Table 9.17 
using PROC GLM for intrablock analysis and PROC MIXED for the combined 
analysis. 

The same comments made in Section 9.10.1 apply here also, and the reader is 
encouraged to perform the computations using previously described methodology 
in order to verify agreement with the computer output. We only make one addi¬ 
tional comment concerning unconfounded effects, for example, main effect A. 
Since A is unconfounded, only intrablock information exists. This is reflected in 
the fact that the estimates for A are the same, namely A = 4.6250, in the PROC 
GLM and PROC MIXED outputs. However, the standard errors are slightly dif¬ 
ferent, 0.8705 for PROC GLM versus 0.8722 for PROC MIXED, due to the fact 
that the estimation procedures for variance components and hence the estimates 
are different (due to unbalancedness) for the two procedures, = 6.0625 for 
PROC GLM and o ^ 2 = 6.0857 for PROC MIXED. The difference in this case is, 
of course, negligible. 
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Table 9.16 2 4 Factorial in Blocks of Size 4 Using Partial Confounding 


options nodate pageno=l ； 
proc factex ； 
factors A B C D ； 
blocks size=4 ； 

model est=(A B C A*C B*C D A*D B*D A*B*C A*B*D A*C*D B*C*D); 
examine design confounding ； 

output out=replicatel blockname=block nvals=(1 2 3 4); 
titlel 'TABLE 9.16'; 

title2 '2**4 FACTORIAL IN BLOCKS OF SIZE 4'; 

title3 'USING PARTIAL CONFOUNDING 1 ; 

run ； 

proc factex ； 
factors A B C D ； 
blocks size=4 ； 

model est=(A B A*B C B*C A*B*C D A*D C*D A*B*D A*C*D B*C*D); 
examine design confounding ； 

output out=replicate2 blockname=block nvals=(5 6 7 8); 
run ； 

proc print data=replicatel ； 

title3 '(PARTIALLY CONFOUNDING AB AC BD CD 1 ; 
title4 'AND COMPLETELY CONFOUNDING ABCD)'; 
run ； 

proc print data=replicate2 ； 
run ； 

The FACTEX Procedure 
Design Points 

Experiment 


Number 

A 

B 

C 

D 

Block 

1 

-1 

-1 

-1 

-1 

4 

2 

-1 

-1 

-1 

1 

1 

3 

-1 

-1 

1 

-1 

1 

4 

-1 

-1 

1 

1 

4 

5 

-1 

1 

-1 

-1 

3 

6 

-1 

1 

-1 

1 

2 

7 

-1 

1 

1 

-1 

2 

8 

-1 

1 

1 

1 

3 

9 

1 

-1 

-1 

-1 

3 

10 

1 

-1 

-1 

1 

2 

11 

1 

-1 

1 

-1 

2 

12 

1 

-1 

1 

1 

3 

13 

1 

1 

-1 

-1 

4 
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Table 9.16 (Continued) 


14 1 1 -1 

15 1 1 1 

16 1 1 1 


1 

1 

4 


Block Pseudo-factor Confounding Rules 

[Bl] = A*B*C*D 
[B2] = C*D 

Design Points 

Experiment 

Number A B C D Block 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 



4 

1 

3 
2 
1 

4 
2 
3 

3 
2 

4 
1 
2 

3 
1 

4 


Block Pseudo-factor Confounding Rules 

[Bl] = A*B*C*D 
[B2] = B*D 


2**4 FACTORIAL IN BLOCKS OF SIZE 4 
(PARTIALLY CONFOUNDING AB AC BD CD 
AND COMPLETELY CONFOUNDING ABCD) 


Obs 

block 

A 

B 

C 

D 

1 

1 

-1 

-1 

-1 

1 

2 

1 

-1 

-1 

1 

-1 

3 

1 

1 

1 

-1 

1 

4 

1 

1 

1 

1 

-1 

5 

2 

-1 

1 

-1 

1 

6 

2 

-1 

1 

1 

-1 
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Table 9.16 (Continued) 



Table 9.17 


Analysis of 2 4 Factorial With Partial Confounding 

options 

nodate 

pageno=l ； 




data partial; 








input 

rep 

block 

: A 

B 

C 

D 

y 

@@; 

datalines ； 









11 

0 

0 

0 

1 

15 

2 

5 

0 

0 

0 

1 

40 

11 

0 

0 

1 

0 

19 

2 

5 

0 

1 

0 

0 

43 

11 

1 

1 

0 

1 

12 

2 

5 

1 

0 

l 

1 

47 

11 

1 

1 

1 

0 

22 

2 

5 

1 

1 

l 

0 

45 

12 

0 

1 

0 

1 

21 

2 

6 

0 

0 

l 

1 

45 

12 

0 

1 

1 

0 

25 

2 

6 

0 

1 

l 

0 

49 

12 

1 

0 

0 

1 

28 

2 

6 

1 

0 

0 

1 

52 

12 

1 

0 

1 

0 

30 

2 

6 

1 

1 

0 

0 

53 

13 

0 

1 

0 

0 

27 

2 

7 

0 

0 

1 

0 

55 

13 

0 

1 

1 

1 

31 

2 

7 

0 

1 

1 

1 

59 

13 

1 

0 

0 

0 

30 

2 

7 

1 

0 

0 

0 

54 

13 

1 

0 

1 

1 

34 

2 

7 

1 

1 

0 

1 

58 
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Table 9.17 (Continued) 


1 4 0 0 0 0 14 
1 4 0 0 1 1 20 
1 4 1 1 0 0 22 
1 4 1 1 1 1 27 


2 8 0 0 0 0 45 
2 8 0 1 0 1 50 
2 8 1 0 1 0 55 
2 8 1 1 1 1 63 


proc print data=partial ； 
titlel 'TABLE 9.17'; 

title2 '2**4 FACTORIAL WITH PARTIAL CONFOUNDING'; 
run ； 


proc glm data=partial ； 
class rep block A B C D ； 
model y=rep block(rep)A|B|C|D@3; 
estimate 1 Main effect A 1 A -1 1 ； 

estimate 1 Interaction AB' A*B 1 -1 -1 l/divisor=2 ； 

title3 'INTRA-BLOCK ANALYSIS'; 

run ； 


proc mixed data=partial ； 

class rep block A B C D ； 

model y= rep A|B|C|D/ddfm=satterth; 

random block(rep )； 

estimate 1 Main effect A' A -1 1 ； 

estimate 1 Interaction AB' A*B 1 -1 -1 l/divisor=2 ； 

title3 'COMBINED 工 NTRA- AND INTER-BLOCK ANALYSIS 1 ; 
run ； 


Obs 

rep 

block 

A 

B 

c 

D 

y 

1 

1 

1 

0 

0 

0 

1 

15 

2 

1 

1 

0 

0 

1 

0 

19 

3 

1 

1 

1 

1 

0 

1 

12 

4 

1 

1 

1 

1 

1 

0 

22 

5 

1 

2 

0 

1 

0 

1 

21 

6 

1 

2 

0 

1 

1 

0 

25 

7 

1 

2 

1 

0 

0 

1 

28 

8 

1 

2 

1 

0 

1 

0 

30 

9 

1 

3 

0 

1 

0 

0 

27 

10 

1 

3 

0 

1 

1 

1 

31 

11 

1 

3 

1 

0 

0 

0 

30 

12 

1 

3 

1 

0 

1 

1 

34 

13 

1 

4 

0 

0 

0 

0 

14 

14 

1 

4 

0 

0 

1 

1 

20 

15 

1 

4 

1 

1 

0 

0 

22 
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Table 9.17 (Continued) 


16 

1 4 

1 

1 

1 

1 

27 

17 

2 5 0 

0 

0 

1 

40 

18 

2 5 0 

1 

0 

0 

43 

19 

2 5 1 

0 

1 

1 

47 

20 

2 5 1 

1 

1 

0 

45 

21 

2 6 

0 

0 

1 

1 

45 

22 

2 6 

0 

1 

1 

0 

49 

23 

2 6 

1 

0 

0 

1 

52 

24 

2 6 

1 

1 

0 

0 

53 

25 

2 7 0 

0 

1 

0 

55 

26 

2 7 0 

1 

1 

1 

59 

27 

2 7 1 

0 

0 

0 

54 

28 

2 7 1 

1 

0 

1 

58 

29 

2 8 

! 0 

0 

0 

0 

45 

30 

2 8 

! 0 

1 

0 

1 

50 

31 

2 8 

! 1 

0 

1 

0 

55 

32 

2 8 

1 1 

1 

1 

1 

63 


2**4 FACTORIAL 

WITH 

PARTIAL CONFOUNDING 



INTRABLOCK ANALYSIS 




The 

GLM : 

Procedure 





Class : 

Level 

Information 



Class 

Levels 

Values 




rep 


2 

12 




block 


8 

12 3 4 

5 6 

7 8 


A 

2 

0 

1 




B 

2 

0 

1 




C 

2 

0 

1 




D 

2 

0 

1 





Number of observations 32 



Dependent 

Variable : 

y 






Sum of 




Source 

DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 

21 

7132.250000 

339.630952 

56.02 

<.0001 

Error 

10 

60.625000 

6.062500 



Corrected 

Total 31 

7192.875000 
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Table 9.17 (Continued) 


R-Square 

Coeff Var 

Root MSE 

y Mean 


0.991572 

6.621081 

2.462214 

37.18750 


Source 

DF 

Type 工 SS 

Mean Square 

F Value 

Pr > F 

rep 

1 

5940.500000 

5940.500000 

979.88 

< . 0001 

block(rep) 

6 

111 .375000 

129.562500 

21.37 

< . 0001 

A 

1 

171.125000 

171.125000 

28.23 

0.0003 

B 

1 

18.000000 

18.000000 

2.97 

0.1156 

A*B 

1 

1.562500 

1.562500 

0.26 

0.6227 

C 

A*C 

1 

120.125000 

120.125000 

19.81 

0.0012 

1 

0.562500 

0.562500 

0.09 

0.7669 

B*C 

1 

2.000000 

2.000000 

0.33 

0.5784 

A*B*C 

1 

4.500000 

4.500000 

0.74 

0.4091 

D 

1 

6.125000 

6.125000 

1.01 

0.3385 

A*D 

1 

1.125000 

1.125000 

0.19 

0.6758 

B*D 

1 

5.062500 

5.062500 

0.84 

0.3823 

A*B*D 

1 

0.500000 

0.500000 

0.08 

0.7798 

C*D 

1 

1.562500 

1.562500 

0.26 

0.6227 

A*C*D 

1 

10.125000 

10.125000 

1.67 

0.2253 

B*C*D 

1 

72.000000 

72.000000 

11.88 

0.0063 

Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

rep 

1 

5940.500000 

5940.500000 

979.88 

< . 0001 

block(rep) 

6 

406.875000 

67.812500 

11.19 

0.0006 

A 

1 

171.125000 

171.125000 

28.23 

0.0003 

B 

1 

18.000000 

18.000000 

2.97 

0.1156 

A*B 

1 

1.562500 

1.562500 

0.26 

0.6227 

C 

1 

120.125000 

120.125000 

19.81 

0.0012 

A*C 

1 

0.562500 

0.562500 

0.09 

0.7669 

B*C 

1 

2.000000 

2.000000 

0.33 

0.5784 

A*B*C 

1 

4.500000 

4.500000 

0.74 

0.4091 

D 

1 

6.125000 

6.125000 

1.01 

0.3385 

A*D 

1 

1.125000 

1.125000 

0.19 

0.6758 

B*D 

1 

5.062500 

5.062500 

0.84 

0.3823 

A*B*D 

1 

0.500000 

0.500000 

0.08 

0.7798 

C*D 

1 

1.562500 

1.562500 

0.26 

0.6227 

A*C*D 

1 

10.125000 

10.125000 

1.67 

0.2253 

B*C*D 

1 

72.000000 

72.000000 

11.88 

0.0063 

Parameter 


Estimate Standard Error 

t Value Pr > |t| 

Main effect 

A 

4.62500000 

0.87052427 

5.31 

0.0003 

Interaction 

AB - 

■0.62500000 

1.23110723 

-0.51 

0.6227 
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Table 9.17 (Continued) 


2**4 FACTORIAL WITH PARTIAL CONFOUNDING 
COMBINED 工 NTRA- AND INTERBLOCK ANALYSIS 

The Mixed Procedure 

Model Information 

Data Set WORK.PARTIAL 

Dependent Variable y 

Covariance Structure Variance Components 

Estimation Method REML 

Residual Variance Method Profile 

Fixed Effects SE Method Model-Based 

Degrees of Freedom Method Satterthwaite 



Covariance Parameters 2 
Columns in X 83 
Columns in Z 8 
Subjects 1 
Max Obs Per Subject 32 
Observations Used 32 
Observations Not Used 0 
Total Observations 32 


Iteration History- 

Iteration Evaluations -2 Res Log Like Criterion 

0 1 107.26388086 

1 2 97.51296909 0.01846419 

2 1 96.69150852 0.00679576 

3 1 96.40491364 0.00138264 

4 1 96.35086623 0.00008239 

5 1 96.34791369 0.00000036 

96.34790114 0.00000000 


6 


1 
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Table 9.17 (Continued) 

Convergence criteria met. 
Covariance Parameter Estimates 


Cov Parm 


Estimate 


block(rep) 
Residual 


36.8465 
6.0857 


Fit Statistics 



-2 : 

Res Log Likelihood 

96.3 



AIC 

(smaller 

is better) 

100.3 



AICC (smaller 

is better) 

101.3 



BIC 

(smaller 

is better) 

100.5 




Type 

3 Tests of Fixed 

Effects 


Effect 


Num 

DF 

Den DF 

F Value 

Pr > F 

rep 



1 

4.62 

38.71 

0.0021 

A 



1 

9.92 

28.12 

0.0004 

B 



1 

9.92 

2.96 

0.1164 

A*B 



1 

10.7 

0.63 

0.4452 

C 



1 

9.92 

19.74 

0.0013 

A*C 



1 

10.7 

0.02 

0.8817 

B*C 



1 

9.92 

0.33 

0.5792 

A*B*C 



1 

9.92 

0.74 

0.4101 

D 



1 

9.92 

1.01 

0.3396 

A*D 



1 

9.92 

0.18 

0.6764 

B*D 



1 

10.7 

0.41 

0.5366 

A*B*D 



1 

9.92 

0.08 

0.7803 

C*D 



1 

10.7 

0.39 

0.5438 

A*C*D 



1 

9.92 

1.66 

0.2264 

B*C*D 



1 

9.92 

11.83 

0.0064 

A*B*C*D 


1 

4.62 

0.01 

0.9139 




Estimates 




Label 

Estimate 

Standard 

Error 

DF 

t Value 

Pr > |t| 

Main effect A 

Interaction AB 

4.6250 

-0.9587 

0.8722 

1.2097 

9.92 

10.7 

5.30 

-0.79 

0.0004 

0.4452 




CHAPTER 10 


Designs with Factors at Three Levels 


10.1 INTRODUCTION 

In discussing the 2 n factorial design in Chapter 7 we saw that main effects and 
interactions can be defined simply as linear combinations of the true responses, 
more specifically as the average response of one set of 2 n ~ l treatment combi¬ 
nations minus the average response of the complementary set of 2 n ~ l treatment 
combinations. And even more specifically, the main effect of a certain factor is 
the average response with that factor at the 1 level minus the average response 
with that factor at the 0 level. Turning now to the situation where each factor 
has three levels, which we shall refer to as 0 level, 1 level, and 2 level, such a 
simple definition of main effects and interactions no longer exists. We can no 
longer talk about the main effect of a factor or the interaction between two or 
more factors but shall talk instead about main effect components or comparisons 
belonging to a certain factor and about interaction components. We shall see how 
all this can be developed as a generalization of the formal approach described 
for the 2 n experiment in Section 7.4. 


10.2 DEFINITION OF MAIN EFFECTS AND INTERACTIONS 
10.2.1 The 3 2 Case 

To introduce the concepts we shall consider first the simplest case, namely that 
of two factors, A and B say, each having three levels, denoted by 0, 1, 2. A 
treatment combination of this 3 2 factorial is then represented by x f = (x\,X 2 ) 
where Xi =0, 1, 2( / = 1,2)，with x\ referring to factor A and X 2 to factor B. 
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We now partition the set of nine treatment combinations into three sets of 
three treatment combinations each according to the levels of factor A: 

set I: {(0,0), (0,1) ， (0, 2)} 
set II: {(1,0), (1,1), (1,2)} 
set III: {(2,0), (2, 1), (2, 2)} 

More formally, we can define these three sets by the three equations: 

set I: xi = 0 

set II: x\ = l (10.1) 

set III: x\ =2 

Comparisons among the mean true responses for these three sets are then said 
to belong to main effect A. Since there are three sets, there are two linearly 
independent comparisons among these three sets (i.e.，their mean responses), 
and these comparisons represent the 2 d.f. for main effect A. For example, the 
comparisons could be (set I — set II) and (set I — III), or (set I — set II) and 
(set I + set II _ 2 set III). 

Similarly, we can divide the nine treatment combinations into three sets corre¬ 
sponding to the levels of factor B or, equivalently, corresponding to the equations: 

x 2 = 0: {(0,0), (1,0), (2,0)} 

^2=1： {(0,1), (1,1), (2,1)} (10.2) 
^2 = 2: {(0,2), (1,2), (2,2)} 


Comparisons among the mean responses of these three sets then constitute main 
effect B. 

As in the 2 n case, the interaction between factors A and B will be defined in 
terms of comparisons of sets (of treatment combinations), which are determined 
by equations involving both x\ and X 2 - One such partitioning is given by 

set I: x\ X 2 = 0 mod 3: {(0,0), (1,2), (2,1)} 

set II: xi + x 2 = 1 mod 3: {(1,0), (0,1), (2,2)} (10.3) 

set III: x\ X 2 = 2 mod 3: {(2,0), (0,2), (1,1)} 


Comparisons among these three sets account for 2 of the 4 d.f. for the A x 5 
interaction. The remaining 2 d.f. are accounted for by comparisons among the 
sets based on the following partition: 


set I: x\ + 2x2 = 0 
set II: x\ + 2x2 = 1 
set III: x\ + 2x2 = 2 


mod 3: 
mod 3: 
mod 3: 


{(0,0), (1,1)，（2,2)} 
{(1,0), (0,2)，(2, 1)} 
{(2,0), (0,1)，(1,2)} 


(10.4) 
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To see what our development so far means with respect to the usual factorial 
representation, we consider (see also Section 1.11.8.1) 

Tij = At Bj -\- (AB)ij (10.5) 

with 

2 2 2 

E A; = 0 ^2 Bj =0 ^ AB )ij = 0 

i=0 7=0 / =0 

for each j, 

2 

7=0 

for each i, where r" is the true response for the treatment combination (x\ = 
i ， X 2 = j). With the model (10.5)，a contrast among sets (10.1)，that is, 

^ c{fi. = ^ CiAi (E q = 0) 
i i 

is the corresponding contrast among A main effects. A contrast among sets (10.2), 
that is, 

^2 c j T -j = ^2 c j B j c j = °) 

j j 

it is the corresponding contrast among B main effects. A contrast among sets 
(10.3) can be written as 

Cl(T00 + Tl2 + T21) + C2(ll0 + Toi + [22) + <^3(^20 + ^02 + ^ll) 

(C\ + C 2 + C 3 = 0) 

which, using (10.5), reduces to the same contrast in the (AB)ij 9 s. The same is 
true for comparisons among sets (10.4)，that is, 

Cl (Too + Til + T22) + C2(TlO + 『02 + ^2l) + <^3(^20 + ^01 + :12) 

{C\ + C 2 + C 3 = 0) 

The reader will notice that the last two comparisons have no particular meaning 
or interpretation for any choice of the q’s, except that they each belong to the 2- 
factor interaction A x B, and that each represents 2 d.f. of that interaction. This is 
in contrast to the parameterization given in Section 1.11.8.1 in terms of orthogonal 
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polynomials. One difference, of course, is that the parameterization given there is 
in terms of single degree of freedom parameters, and a second difference is that 
it is meaningful only for quantitative factors, whereas the definitions in terms of 
the partitions as summarized in (10.6) below hold for quantitative and qualitative 
factors. But the most important point is that the current definitions of effects and 
interactions will prove to be important in the context of systems of confounding 
(see Section 10.5) and fractional factorials (see Section 13.4). 

To sum up our discussion so far, the effects and interactions for a 3 2 experiment 
are given in pairs of degrees of freedom by comparisons among three sets of 
treatment combinations as follows: 


A: 

xi = 0, 1, 2 



B: 

^2 = 0, 1, 2 

mod 3 

(10.6) 


Jxi + X2 = 0, 1, 2 



A x B: 

[xi + 2x2 = 0, 1, 2 




It is convenient to denote the pairs of degrees of freedom corresponding to 
xi + X 2 = 0, 1, 2 by the symbol AB and the pair corresponding to x\ + 2x2 = 
0, 1,2 by AB 2 . 

It is easy to see that the groups given by the symbols AB 2 and A 2 B are 
the same. It is, therefore, convenient, in order to obtain a complete and unique 
enumeration of the pairs of degrees of freedom, to adopt the rule that an order 
of the letters is to be chosen in advance and that the power of the first letter in a 
symbol must be unity. This latter is obtained by taking the square of the symbol 
with the rule that the cube of any letter is to be replaced by unity, that is, if the 
initial letter of the symbol occurs raised to the power 2, for example, A 2 B, we 
then obtain 


A 2 B = (A 2 B) 2 = A 4 B 2 = AB 2 


This procedure follows from the fact that the partitioning produced by 

2x\ + X 2 = 0, 1,2 


is the same as that produced by 


that is, 


2 - 2xi + 2x2 = 0, 2, 1 


x\ + 2 x 2 = 0, 2, 1 


which is the partitioning denoted by AB 2 . 
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Table 10.1 

Effects and Interactions for 3 3 

Experiment 






Left-Hand Side of 

Effect/Interaction 

Defining Equation 

A 



xi 

B 



X2 

Ax B 

\ab 


x\ +X 2 


[ab- 


X\ + 2X2 

C 



叉 3 

AxC 

\ac 


X\ +X 3 


[ac : 


X\ + 2^3 

B x C ‘ 

\bc 


X2+X2, 

[BC Z 

X2 + 2^3 


1 

ABC 

X\-\-X2 + X3 

A x B x C 

ABC 2 

X\ X2~\~ 2X3 

AB 2 C 

X\ + 2x2 + X 3 


1 

ab 2 c 2 

X\ + 2x2 + 2^3 


10.2.2 General Case 

The procedure of formally defining effects and interactions, illustrated for the 3 2 
experiment, can be extended easily to the 3 〃 case. We shall have then (3 n — 1)/2 
symbols, each representing 2 d.f. For example, for the 3 3 experiment there will 
be 13 symbols as given in Table 10.1 together with their defining equations of 
the form a\x\ + + 0 ^ 3 x 3 =0, 1, 2 mod 3. 

For the general case of the 3 n experiment, denoting the factors by 
Ai, A2 ,..., A n , the (3 n — 1)/2 symbols can be written as A^ 1 , A^ 2 , , An n 
with oi[ = 0 , 1 , 2 (/ = and the convention that ( 1 ) any letter A[ with 

oti = 0 is dropped from the expression, ( 2 ) the first nonzero a is equal to one 
(this can always be achieved by multiplying each oti by 2), and (3) any oti = 1 
is not written explicitly in the expression. (This is illustrated in Table 10.1 by 
replacing A by A\, B by A2, and C by A3.) 

The ^-tuple a' = (ai, c^，• • • ， a n ) associated with A^ 1 A^ 2 - - - A^ n is referred 
to as a partition of the 3" treatment combinations into three sets according to 
the equations 


a\x\ + a ， 2 X 2 + • • • + oi n x n =0, 1,2 mod 3 (10.7) 



364 


DESIGNS WITH FACTORS AT THREE LEVELS 


We now list some properties of such partitions: 

1. Each partition leads to three sets of 3 W_1 treatment combinations each as 
is evident from Eqs. (10.7). 

2. If a' = (ofi, 0 ^ 2 , ... ， a n ) and 〆 =d 卩 2, … ， ^n) are two distinct parti¬ 
tions, then the two equations 


a\X\ + 012X2 + - • 

.+ ot n x n = 8\ mod 3 

(10.8) 

and 



PlXl + ^2X2 + - ■ 

.+ p n x n = 82 mod 3 

(10.9) 

are satisfied by exactly 3 n ~ 2 treatment combinations x' = (x\,X2, . 

.• ， ^n)• 


This implies that the set of treatment combinations determined by a r x = 
has exactly 3 n ~ 2 treatment combinations in common with each of the 
three sets determined by the equations p f x = 0, 1, 2 mod 3, respectively. 
It is in this sense that the two partitions a and P are orthogonal to 
each other. 

3 . If a treatment combination x f = (x\, X2,, x n ) satisfies both Eqs. 
(10.8) and (10.9) for a particular choice of 8\, 82, then x also satisfies 
the equation 

(a\ + P\)x\ + (a 2 + ^ 2)^2 H - h (a n + Pn)x n = +(5 2 mod 3 

( 10 . 10 ) 

Equation (10.10) is, of course, one of the three equations associated 
with the partition a’ + 〆 = {ot\ + ^ 1 , + ^ 2 , in which 

each component is reduced mod 3, and hence with the interaction 
Ap+h A; 2 # 2 . •. An n+ ^ n . In agreement with the definition in Section 7.4 
we refer to A ^ 2 " 1 "^ 2 - - - as generalized interaction 

(GI) of E a = A a x l A a 2 2 • • • A^ n and … 於 . 

In addition to satisfying (10.10)，the treatment combination x, which sat¬ 
isfies (10.8) and (10.9)，also satisfies the equation 

(ofi + 2 P\)x\ + (oi2 + 2^2 )-^2 + • • • + (oi n + 2 ^ n )x n = + 282 mod 3 

( 10 . 11 ) 

which is associated with the partition a! + 2 〆 and hence the interaction 
E a+2 ^ = ^ 2 + 2 ^ 2 ... A^ n+2 ^. This interaction is therefore another 

GI of E a and . To summarize then, any two interactions E a and 
have two GIs and E a+2 择 , where a + )8 and a + 2)8 are formed 

mod 3 and are subject to the rules stated earlier. We illustrate this by the 
following example. 
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Example 10.1 Consider AB and ABC 2 in the 3 3 case, that is, a’ = (1, 1,0) 
and P r = (1, 1,2). Then 

(a + py = (2, 2, 2)E(1,1 ， 1) 

and 

(a+ 2)8/= (3,3, 4)^(0, 0, 1) 

and hence the GIs of AB and ABC 2 are ABC and C. Another way of obtaining 
this result is through formal multiplication and reduction mod 3, that is, 

(AB)(ABC 2 ) = A 2 B 2 C 2 = (A 2 B 2 C 2 ) 2 = ABC 


and 

(AB)(ABC 2 ) 2 = A 3 B 3 C 4 = C □ 


10.3 PARAMETERIZATION IN TERMS OF MAIN EFFECTS 
AND INTERACTIONS 


The symbols used in the previous section to denote pairs of degrees of freedom 
will also be used to denote the magnitude of effects and interactions in the fol¬ 
lowing way (see also Section 10.4). Each symbol represents a division of the set 
of 3 n treatment combinations into three sets of 3 n — 1 treatment combinations 
each. The symbol, with a subscript that is the right-hand side of the equation 
determining the particular one of the three sets in which the treatment combina¬ 
tions lie, will denote the mean response of that set as a deviation from the overall 
mean, M. If E a = A°^ 1 A^ 2 - - - A^ n represents an interaction, then 


Er = (A^Af...^) i = 


(mean of treatment combinations 
\ satisfying a r x = i mod 3 


-M (10.12) 


We shall also use the notation £^, x for given a and x to denote one of the 
quantities E^, E^, depending on whether a f x =0, 1, 2 mod 3, respectively. 
We note that a comparison belonging to E a is, of course, given by 


coEq + c\E^ + C2E2 (c 0 + ci + c 2 = 0) (10.13) 

Also, it follows from (10.12) that 

£« + + £« = 0 (10.14) 


so that any comparison of the form (10.13) could be expressed in terms of only 
two Ef. Such a procedure was, in fact, adopted for the 2 〃 factorial, but as we 
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shall see below, in the present situation this would only lead to a certain amount 
of asymmetry. 

As an extension of (7.42) we can now state and prove the following result, 
which expresses the response a(x) of a treatment combination x as a linear 
combination of interaction components. This parameterization of a(x) is given by 

a(x) = M + E( 10 . 15 ) 

a 

where summation is over all a! = {a\, o^，. •. ， a n ) ^ (0,0,, 0), subject to 
the rule that the first nonzero at equals one, and a!x is reduced mod 3. The 
proof of (10.15) follows that of (7.42) and will be given for the general case in 
Section 11.5. 

We illustrate (10.15) with the following example. 

Example 10.2 Consider the 3 3 factorial with factors A, B，C and denote 
the true response of the treatment combination (/, j, k) by aibjCk. Then (10.15) 
can be written as 

ciibjCk = M -\- At -\- Bj + ABf +2 j 

+ Q + ACi^k + ACf + 2k + j-\-k 

+ BC )+2k + ^BC i+ j + k + ABCf_^j +2 k 
+ AB 2 Ci^2j-\-k + AB 2 C} +2 j +2k 
For i = l, j = l, k = 2, for example, this becomes 

dib\C2 = ^ ~h A.\ + 石 1 + A.B 2 ~h -|- C*2 ~h AC*o 
+ AC| + BCo- {- BCl + ABCi + ABC\ 

+ ab 2 c 2 + ab 2 c\ □ 

We emphasize again that the parameterization (10.15), which because 
of (10.14) is a non-full-rank parameterization, becomes important in connec¬ 
tion with systems of confounding (Section 10.5) and fractional factorials 
(Section 13.4). 

10.4 ANALYSIS OF 3 n EXPERIMENTS 

Suppose that each treatment combination is replicated r times in an appropriate 
error control design, such as a CRD or a RCBD. Comparisons of treatments are 
then achieved by simply comparing the observed treatment means, and tests for 
main effects and interactions are done in an appropriate ANOVA. 
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Table 10.2 ANOYA for 3 3 Experiment in Randomized Complete 
Block Design 


Source 

d.f. 


ss 

Blocks 

r - 1 

27 Ed 

-y....) 2 

Treatments 

3 3 - 1 = 26 



A 

2 

7=0 

-y....) 2 

B 

2 

9 土 (y.i 

-y....) 2 



k=0 


Ax B 

4 

3r H (y-jk- 

— y.j.. — y..k- 十歹 ….) 2 



jk 




2 


C 

2 

€-0 

-y....) 2 

AxC 

4 

3r J2 (y-j-i 

— y-j- ~ y-t + 歹 … .) 2 





B xC 

4 

3r (歹 

— y ： k- ~~ y-t + 歹 … .) 2 



ki 


A x B x C 

8 

r J2 (y-jke . 

— y.jk. — y-j-t ~ y-u + y-j- 



jkt 




+ 乂乂 . + y-t ~ y .…) 

Residual 

26(r - 1) 

XI ( 歹讲 - 

刃…一 y.jki 十歹 .…) 2 



ijki 



For purposes of illustration we consider a 3 3 experiment in an RCBD with r 
blocks. With the usual model 

yijkt = M + A. + ^jki ^ijkt 

or 

y = fiO + ^ + X T r + e 

where (Jkt) denotes the level combinations for the three factors A, B, C, and 
with the factorial structure of the treatments, we obtain the usual ANOVA given 
in Table 10.2. 

An alternative way of computing the various components of the treatment sum 
of squares is based upon the definition of the 2 -d.f. components of any interaction 
and the corresponding symbols defined in Section 10.3. Let E a denote any such 
interaction component, such as AB or AB 2 C, and let be the mean 
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Table 10.3 Partitioning of 3-Factor Interaction 
Sum of Squares 


Source 

d.f. 

SS 

ABC 

2 

9r j [ASCo] 2 + [ABC l f + [A5CS] 2 ) 

ABC 2 

2 

9r { [ABCl] 2 + [ABCj] 2 + [Xsc|] 2 } 

AB 2 C 

2 

9r{ LAS^CoJ 2 + [AB^Ci] 2 + [AB^C 2 f\ 

ab 2 c 2 

2 

9r { + [AB^Cj] 2 + 


observed responses (as a deviation from the overall mean) of the three sets 
defining E a . The sum of squares associated with E a ， accounting for 2 d.f.，is 
then given by 


SS(E a ) = rr- 1 {[玲] 2 + [E^f + [玲] 2 j (10.16a) 

with 

E [SS(£ a )] = 2o] + rT~ l j [ 印 ] 2 + [E a x f + [ 玲 ] 2 } (10.16b) 

Specifically, for the 3 3 experiment the sum of squares due to 3-factor interaction, 
for example, can be broken down as given in Table 10.3. The usefulness of this 
procedure will become apparent when we consider systems of confounding in 
Section 10.5. The SS given in (10.16a) is simply the SS (among sets) for the 
sets defined by a f x = 0, 1,2. Since the various partitions are orthogonal to each 
other, so are their associated SSs. It is not difficult to show that the sum of the 
four SSs in Table 10.3 is the same as SS(A x Bx CJ^in Jable 10.2. 

Generally, it is also useful to list the quantities E^, for main effects and 

interactions as they can be used to estimate the yield of any treatment combination 
or comparisons among treatment combinations (see Section 10.3). 


10.5 CONFOUNDING IN A 3 n FACTORIAL 

The necessity for using incomplete blocks for a 3 n factorial is even more obvious 
than for a 2 〃 factorial. Even for small n the number of treatment combinations 
is often too large for complete blocks, either because such large blocks are not 
available or they are no longer homogeneous. The use of smaller blocks will 
then lead to confounding of effects and interactions with blocks, a notion that we 
have discussed extensively for the 2 n factorials. We shall see that the basic ideas 
encountered there can be modified easily for 3 〃 factorials. We shall illustrate this 
with a simple example before discussing useful systems of confounding more 
generally. 
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10.5.1 The 3 3 Experiment in Blocks of Size 3 

Suppose we wish to arrange a 3 3 experiment in blocks of size 3. This will result in 
nine blocks with 8 d.f. among them. Hence, we need to confound with blocks four 
pairs of degrees of freedom, each pair representing a main effect or an interaction 
component. Suppose we first choose to confound AB and AC 2 . Recall that AB 
is represented by comparisons among the sets satisfying the equations 

x\-\- X 2 = 0, 1, 2 mod 3 

and that AC 2 is similarly represented by the equations 

x\ + 2^3 =0, 1,2 mod 3 
It is obvious then that considering jointly 

x\-\- X 2 = k mod 3 

x\ + 2^3 = l mod 3 (10.17) 

for all possible combinations of k, l, = 0, 1, 2, we partition the 27 treatment 
combinations into 9 sets of 3 treatment combinations each, these sets being the 
blocks for the desired system of confounding. Now, any treatment combination 
which satisfies (10.17) for given (k ， i), also satisfies 

2x\ + ^2 + 2^3 = k +1 mod 3 

or, equivalently, 

x\ + 2 x 2 + ^3 = 2(k + i) mod 3 

and since 2(k + ^) = 0, 1, 2 mod 3 it follows that there are three sets of three 
blocks which satisfy the equation 

x\ + 2x2 + X 3 = 0, 1, 2 mod 3 

respectively. Comparisons among these sets, however, define the interaction com¬ 
ponent AB 2 C. Hence AB 2 C is also confounded with blocks, and we recognize 
immediately that AB 2 C is a GI of AB and AC 2 ; that is, 

(AB) x (AC 2 ) = A 2 BC 2 = A 4 B 2 C 4 = AB 2 C 

Similarly, any treatment combination that satisfies (10.17) also satisfies the 
equation 


(x\ + X 2 ) + 2{x\ + 2 又 3 ) = k-\-21 mod 3 
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Table 10.4 Plan for 3 3 Experiment in Blocks of Size 3 


1 

2 

3 

4 

5 

6 

7 

8 

9 

000 

002 

001 

010 

012 

Oil 

020 

022 

021 

121 

120 

122 

101 

100 

102 

111 

110 

112 

212 

211 

210 

222 

221 

220 

202 

201 

200 


or 

X 2 ~\~ = k-\-21 mod 3 

Hence the other GI 

(AB)(AC 1 2 ) 2 = A 3 BC 4 = BC 

is also confounded with blocks. These four interactions, AB, AC 2 , AB 2 C, and 
BC, then account for the 8 d.f. for comparisons among blocks. 

The composition of the blocks for the above system of confounding can be 
obtained from Eqs. (10.17) with (k, i) assuming all possible values. Alternatively, 
we can construct first the intrablock subgroup (IBSG) from 

x\ X 2 = 0 mod 3 

x\ + 2x3 = 0 mod 3 

and then, using the x representation for treatment combinations, add (componen¬ 
twise and mod 3) a treatment combination, not already contained in the IBSG, to 
each element in the IBSG. This process is continued as described in Section 8.3, 
until all blocks have been constructed in this manner as given in Table 10.4. 

10.5.2 Using SAS PROC FACTEX 

A similar design can be obtained by using SAS PROC FACTEX and is given in 
Table 10.5. 

We shall comment briefly here on some aspects of the SAS output and how 
it relates to our discussion in this chapter: 

1. We note that rather than using 0 ， 1, 2 for the factor levels, SAS uses —1, 
0, 1, respectively, as commonly used in response surface and regression 
methodology. 

2. The single degree of freedom associated with the main effects and interac¬ 

tions are listed formally akin to the linear-quadratic effects representation 
for quantitative factors (see 1.11.8.1)，for example, 


A —— ^ A linear 
2 * A —— ^ A quadratic 
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Table 10.5 3 3 Factorial in Blocks of Size 3 


options nodate pageno=l ； 
proc factex ； 
factors A B C/nlev=3 ； 
blocks size=3 ； 
model est=(A B C); 

examine design confounding aliasing ； 

output out=design blockname=block nvals=(1 23456789 )； 
titlel 'TABLE 10.5'; 

title2 '3**3 FACTORIAL IN BLOCKS OF SIZE 3'; 
run ； 

proc print data=design; 
run ； 

The FACTEX Procedure 


Design Points 

Experiment 

Number ABC Block 


1 -1 

2 -1 

3 -1 

4 -1 

5 -1 

6 -1 

7 -1 

8 -1 

9 -1 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 0 

17 0 

18 0 

19 1 

20 1 

21 1 

22 1 

23 1 

24 1 

25 1 

26 1 

27 1 



0 

0 

0 

1 

1 

1 


0 

0 

0 

1 

1 

1 


0 

0 

0 

1 

1 

1 



1 

9 

5 

6 
2 

7 

8 
4 
3 

3 
8 

4 

5 
1 
9 
7 

6 
2 
2 

7 
6 

4 
3 

8 
9 

5 
1 


Block Pseudofactor Confounding Rules 

[Bl] = (2*A) + (2*B) + (2*C) 

[B2] = B+(2*C) 
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Table 10.5 (Continued) 


Aliasing Structure 





A 







(2*A) 







B 







(2*B) 







C 







(2*C) 







A 

+ 

B 





[B] = (2*A) 

+ 

B 





[B] = A 

+ 

(2*B) 





(2*A) 

+ 

(2*B) 





A 

+ 

C 





[B] = (2*A) 

+ 

C 





[B] = A 

+ 

(2*C) 





(2*A) 

+ 

(2*C) 





B 

+ 

C 





[B] = (2*B) 

+ 

C 





[B] = B 

+ 

(2*C) 





(2*B) 

+ 

(2*C) 







Obs 

block 

A 

B 

c 



1 

1 

-1 

-1 

-1 



2 

1 

0 

0 

0 



3 

1 

1 

1 

1 



4 

2 

-1 

0 

0 



5 

2 

0 

1 

1 



6 

2 

1 

-1 

-1 



7 

3 

-1 

1 

1 



8 

3 

0 

-1 

-1 



9 

3 

1 

0 

0 



10 

4 

-1 

1 

0 



11 

4 

0 

-1 

1 



12 

4 

1 

0 

-1 



13 

5 

-1 

-1 

1 



14 

5 

0 

0 

-1 



15 

5 

1 

1 

0 



16 

6 

-1 

0 

-1 



17 

6 

0 

1 

0 



18 

6 

1 

-1 

1 



19 

7 

-1 

0 

1 



20 

7 

0 

1 

-1 



21 

7 

1 

-1 

0 



22 

8 

-1 

1 

-1 



23 

8 

0 

-1 

0 



24 

8 

1 

0 

1 



25 

9 

-1 

-1 

0 



26 

9 

0 

0 

1 



27 

9 

1 

1 

-1 
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and 


A-\- B —— ^ A linear x B linear 


2 * A + B —— ^ A quadratic x B linear 
A -\-2^ B —— ^ A linear x B quadratic 
2 * A + 2 * ^ —— ^ A quadratic x B quadratic 

We should point out, however, that these representations are not identical 
[see also (4) below]. 

3. The confounding rules are essentially the same as those we have explained 
earlier. In this example the block compositions are obtained by satisfying 
the equations 


2*A + 2*5 + 2*C = 


B 2 ^ C = 82 

for some 8 \, 82 (= 0, 1,2 mod 3), where A ， B ，and C are the levels of those 
factors. In our notation this is equivalent to satisfying the equations 


or 


2xi + 2^2 + 2 x 3 = Vi 
X2 + 2^3 = Y2 

xi-\-x 2 -\-X3 = y* 
X2 + 2X3 = Y2 


Hence, in this example we confound ABC and BC 2 and, hence, AB 2 and 
AC 2 with blocks. We only need to remember that —1 = 2 mod 3 and 
—2 = 1 mod 3. 

4. The aliasing structure gives a list of the main effects and 2-factor interac¬ 
tions that are either estimable or confounded with blocks, the latter being 
identified by [5]. More precisely, we should really say that the aliasing 
structure represents a list of the number of degrees of freedom associ¬ 
ated with estimable and confounded effects, respectively. For example, 
the output identifies A-\- B and 2*A + 2*^ as estimable. This does not 
mean, however, that A linear xB linear, or A quadratic xB quadratic are 
estimable since there is no relationship between these components and the 
2-d.f. component AB. 
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10.5.3 General Case 

Following the example from Section 10.5.1 it is easy to see that for the general 
case of a 3 n factorial in blocks of size 3 P we then have the following: 

1. 3 n ~ p — 1 d.f. are confounded with blocks, that is, (3 n ~ p — 1)/2 main 
effects and/or interactions are confounded with blocks. 

2. If two interactions E a and are confounded with blocks, then their GIs 

anc j _E a+2 办 are also confounded with blocks. 

3. To find a system of confounding, one needs to specify only q = n — p 
independent main effects and/or interactions E Ul ， E ai ,..., E a q since the 
q interactions have 


2 




-3 q 


GIs among them. Hence altogether (3 q — 1)/2 interactions are confounded 
with blocks. 

4. The composition of the blocks is obtained by means of the IBSG, which 
is composed of the treatment combinations satisfying the equations 

OLj\X\ + aj2X2 H - h Oij n X n = 0 


(j = 1,2,q = n — p) as determined by the independent confounded 
interactions E ai , E a2 ,..., E a< i in (3). The remaining blocks are then 
obtained as described in Section 10.5.1. 


10.6 USEFUL SYSTEMS OF CONFOUNDING 

As we have mentioned earlier, the number of treatment combinations is quite 
large even for a moderate number of factors. This would call in most cases for 
incomplete blocks and hence for a system of confounding. But even this may 
lead to certain difficulties since at this time we are only considering blocks the 
size of which is a power of 3, so that the choice is quite limited. (For other 
block sizes we refer to Section 11.14.4.) To complicate matters, according to 
Fisher’s (1942, 1945) theorem (see Section 11.7) confounding of main effects 
and/or 2-factor interactions can be avoided only if the block size is larger than 
twice the number of factors, that is, k > 2n. For purposes of reference we list 
in Table 10.6 possible types of confounding involving up to five factors and 
various block sizes. Further systems can be obtained from this list by permuting 
the letters. 
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Table 10.6 Types of Systems of Confounding for 3 n Experiments 


Number of 

Factors 

Size of 
Blocks 

Confounded Effects/Interactions^ 

2 

3 

Any main effect or interaction 

3 

3 

A*, B*, AB, AB 2 

A*,BC*,ABC,AB 2 C 2 

a*,bc 2 *,ab 2 c,abc 2 

ab*,ac*,bc 2 ,ab 2 c 2 

ab*,ac 2 *,bc,ab 2 c 

ab 2 \ac*,bc,abc 2 

ab 2 \ac 2 \bc 2 ,abc 


9 

Any main effect or interaction 

4 

3 

A*, BC*, ABC, AB 2 C 2 , BD*, ABD, AB 2 D 2 , 
bc 2 d 2 , CD 2 , ab 2 cd, acd 2 , ac 2 d, abc 2 d 2 

5*, AC*, ABC, AB 2 C, AD t , ABD 2 , AB 2 D 2 , 

AC 2 D, CD, AB 2 C 2 D, BCD, ABC 2 D, BC 2 D 2 

C*, AB*, ABC, ABC 2 , AD*, ACD, AC 2 D, 

AB 2 D 2 , BD 2 , AB 2 C 2 D 2 , BCD 2 , AB 2 CD 2 , BC 2 D 2 
D*, AB 2 *, AB 2 D, AB 2 D 2 , AC t , AC 2 D, AC 2 D 2 , 

ABC, BC 2 , ABCD 2 , BC 2 D 2 , ABCD, BC 2 D 


9 

A*, B*, AB, AB 2 

a*,bc*,abc,ab 2 c 2 

A*, BCD*, ABCD, AB 2 C 2 D 2 

ab*,ac*,bc 2 ,ab 2 c 2 

AB*, CD*, ABCD, ABC 2 D 2 

AB*, ACD*, BC 2 D\ AB 2 C 2 D 2 

ABD*, ACD r , AB 2 C 2 , BC 2 D 2 


27 

Any main effect or interaction 

5 

9 

BE' A5C*, AB 2 CE, ACE 2 , CDE* 

BCDE 2 , BC 2 D 2 , ABC 2 DE, ABD 2 E 2 

ab 2 c 2 de 2 , ab 2 d 2 , ac 2 d, ad 2 e 


27 

ABC*, AB 2 DE*, AC 2 D 2 E 2 , BC 2 DE 


Effects with an asterisk (*) are the independent effects. 


Generally not all components of a particular interaction are confounded with 
blocks and, hence, limited intrablock information on that interaction is still avail¬ 
able (see also Sections 10.7.2 and 10.7.3). Even so, in most practical cases it will 
be useful to resort to partial confounding. These can be obtained easily from the 
systems provided in Table 10.6. In the following we shall comment briefly on 
some such systems. 
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10.6.1 Two Factors 

Since we can confound any pair of effect or interaction degrees of freedom, the 
following systems suggest themselves: (1) Use a basic pattern of two replicates, 
confounding AB and AB 2 , giving full information on main effects and j infor¬ 
mation on the interaction. (2) Use a basic pattern of four replicates, confounding 
A, B, AB, and AB 2 , each in one of the four replicates, giving equal information 
on main effects and interactions. The arrangement of the treatment combinations 
in blocks and the structure of the analysis of variance with q repetitions of the 
basic pattern are given in Table 10.7. 


10.6.2 Three Factors 

1. Blocks of Size 3 Consulting Table 10.6 suggests that a suitable system of 
confounding consists of a basic pattern of four replicates of the following 
types: 


Type I confound 
Type II confound 
Type III confound 
Type IV confound 


ab,ac,bc 2 ,ab 2 c 2 

AB,AC 2 , BC,AB 2 C 

ab 2 ,ac,bc,abc 2 

ab 2 ,ac 2 ,bc 2 ,abc 


This will yield full information on main effects, 士 relative information on 
2-factor interactions, and | relative information on 3-factor interactions. 

2. Blocks of Size 9 The most useful design consists of one or more repetitions 
of a basic pattern of four replicates confounding ABC, ABC 2 , AB 2 C, 
and AB 2 C 2 , respectively. This will result in full information on all main 
effects and 2-factor interactions and | relative information on the 3-factor 
interaction components. The arrangement of the blocks is given in Table 
10.8, each column (level of C) combined with the levels of A and B giving 
a block and each set of three columns a replicate. 


10.6.3 Treatment Comparisons 

At this point we comment briefly on evaluating the amount of information on 
any treatment comparison provided by the confounded design relative to the 
unconfounded design. To do so we make use of the fact that the yield of a 
treatment combination can be represented in terms of main effect and interac¬ 
tion components (see Section 10.3). For purposes of illustration suppose that we 
are interested in the comparison (aob^co — a^boci), using q repetitions of the 
basic pattern given in Table 10.8. Notice that aoboco and aob^ci never occur 
together in the same block, so that a simple comparison among their mean yields 
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Table 10.7 3 2 Experiment in Blocks of Size 3 


Replicate 

Effect/Interaction 


Type 

Confounded 

Blocks 


a. Arrangement 

in Blocks 

I 

A 

00, 01, 02; 10, 11, 12; 20, 21, 22 

II 

B 

00, 10, 20; 01, 11,21; 02,12,22 

III 

AB 

00, 12, 21; 01, 10, 22; 02, 20, 11 

IV 

AB 2 

00, 11, 22; 10, 02, 21; 01, 12, 20 


Source 

d.f. 


b. Analysis of Variance 


Replicates 

Aq — \ 


Blocks/reps 

Sq 


A 

2 


B 

2 


AB 

2 


AB 2 

2 


Residual 

24^-8 


A x reps 

2(3q - 1) 


II, III, IV 



B x reps 

2(3q - 1) 


I, III, IV 



AB x reps 

2(3q - 1) 


I ， II， IV 



AB 2 x reps 

2(3^ - 1) 


I ， II ， III 



Table 10.8 Design for 3 3 Experiment in Blocks of Size 9 


Level of 

A B 




Confounded Interaction 




ABC 

ABC 2 

ab 2 c 

ab 2 c 2 






Level of C 





_ 

m 

m 

1 

2 

u 

2 

1 

n 

1 

2 

m 

2 

1 

1 

■mm 

2 

□ 

1 

1 

□ 

2 

2 

□ 

1 

i 

El 

2 

2 

■■ 

1 

2 

m 

2 

1 


1 

2 

19 

2 

1 


■ 

i 

2 

□ 

l 

1 

□ 

2 

1 

2 


2 

1 

■ 

1 

l 

1 

2 

國 

2 

1 

EH 

HI 

1 

2 

■ 

2 

1 

2 

l 

■ 

1 

2 

■ 

2 

1 

2 

□ 

1 

1 

□ 

2 

■ 

2 

1 

2 

國 

2 

1 


2 

9 

1 

1 


2 

1 

2 

■ 

1 

2 

■ 

2 

1 

1 

2 

■ 

2 

1 

■ 

2 

2 

2 

Q 

1 

1 

B 

2 

■ 

1 

2 

■ 

2 

1 
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will not provide any useful information. Instead, we make use of the fact that 
[see (10.15)] 


cioboCQ = M -\- Ao + Bq + ABq + AB^ + Co 

+ AC 0 + ACl + BC 0 + BCl + ABC 0 
+ ABCI + AB 2 Cq + AB 2 Cl 
and 

aob()C\ = M + Aq + Bq + ABq + + C\ 

+ ACi + ACj + BCi + BC\ + ABCi 
+ ABCI + AB 2 Ci + AB 2 Cl 

so that the estimator for (aoboco — aoboci) is 

cioboco — aoboci = (Co — C\) + (ACo — AC\) 

+ (充 0 - Acl) + (兒 0 - BCi) 

+ (bcI - Bcfj + - ABC\) 

+ {ABCi - ABC 7 ^ + (as2C 0 - AB^Ci) 

^-2 --^- 2 \ 

+ ( A5 2 C 0 - ab 2 c 2 ) 

Each quantity in parentheses is statistically independent of the others (because 
of the orthogonality of the partitions), with a variance depending on the sys¬ 
tem of confounding used. In the present case any difference among main effect 
components (like Co — C\) and among 2-factor interaction components (like 
ACo — AC\) is estimated with variance 

2 2 

o a e = _^£_ 

4q . 9 I8q 

and any difference among 3-factor interaction components (like ABCo — ABC\) 
is estimated with variance 
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Hence 

- --- ^-\ 2cr 》 31 2 

Var ( ao ^ oCo - aoboCl ) =5 -18^ + 4 -27^ = ^ 

With no confounding and the same error variance, the variance of the comparison 

would have been a^/2q, so that the relative information on this comparison is 
54 _ 27 ( 

62 — 3l* 

10.6.4 Four Factors 

1. Blocks of Size 3 A suitable system consists of a basic pattern of four 
replicates, using the four systems of confounding given in Table 10.6. 
This design will result in | information on main effects and all 2-factor 
interaction components. 

2. Blocks of Size 9 The most useful type of confounding is obviously the 
last one given in Table 10.6 since it confounds only 3-factor interaction 
components, one from each type of 3-factor interaction. Altogether there 
exist eight such systems of confounding. 

3. Blocks of Size 27 In general, the experimenter will wish to avoid blocks of 
size as large as 27, though, in some fields of experimentation and with some 
types of experimental material, the effect on error variance of reducing 
block size from 27 to 9 may be so small as not to offset any loss in 
relative information that results from confounding. If blocks of size 27 
are being used, any of the eight 4-factor interaction components may be 
confounded. 

10.6.5 Five Factors 

1. Blocks of Size 9 It is not possible to avoid confounding a main effect or 2- 
factor interactions. Under these circumstances, the system of confounding 
given in Table 10.6 and permutation of that set would be most useful. 

2. Blocks of Size 27 One can show that it is not possible to find a design 
confounding only 4 - and 5-factor interactions. The most useful system of 
confounding is then one of the form given in Table 10.6. 

10.6.6 Double Confounding 

Occasionally it may be desirable to impose a double restriction on the pattern of 
a 3 n experiment. This leads to systems of double confounding. Suitable systems 
can be found by consulting Table 10.6. The actual arrangement of the treatment 
combinations can be obtained by first constructing the IBSGs for confounding 
with “rows” and “columns,” respectively, and then adding the elements of the first 
row and first column termwise mod 3. As an example consider the 3 3 experiment 
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with rows of size 9 ， confounding ABC 2 , and columns of size 3 , confounding 
AB ， AC, BC 2 , AB 2 C 2 ; that is, construct row blocks from 

x\ X2~\~ 2x3 = 0 mod 3 

and column blocks from 


xi + X2 = 0 mod 3 
xi + X3 = 0 mod 3 

The final arrangement (apart from randomization of rows and columns) then is 
as follows: 


Column 



1 

2 

3 

4 

5 

6 

7 

8 

9 

1 

000 

101 

202 

011 

112 

210 

022 

120 

221 

Row 2 

122 

220 

021 

100 

201 

002 

111 

212 

010 

3 

211 

012 

110 

222 

020 

121 

200 

001 

102 


Obviously, one will need replications of this or a similar arrangement, using 
different systems of confounding. 


10.7 ANALYSIS OF CONFOUNDED 3 n FACTORIALS 

In the course of our discussion of 3 n factorial experiments we have already 
commented on some aspects of the analysis for particular situations. We shall 
now make some remarks about the general 3 n factorial experiment in blocks 
of size 3 p using some system of complete or partial confounding. Since the 
development parallels that of Section 9 . 7 , we shall not repeat all the details, but 
only those that are specific to the 3 n case. 

The basic model underlying the analysis is as before: 

j + X ⑷ + X T r + ^ 


where p represents the replicate effects, P* the block within replicate effects, 
and r the treatment effects, or in its reparameterized form 


y = + X pp + + X T *r* + e 

where r* represents the interaction components for all admissible a != 
(ai, , ot n ) and / = 0 , 1 , 2 . A basic pattern of partial confounding consists 

of 5 types of replicates, each replicate consisting of 3 n ~ p blocks, and the basic 
pattern is repeated q times. 
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We divide the (3 n — 1)/2 interactions E a into three mutually exclusive sets 
based on the amount of confounding: 


£1 = 

= {E ak ,k 

= 1 , 2 ,.. 

.,n \： completely confounded} 

£2 = 

= 

= 1 , 2 ,.. 

.,n 2 : partially confounded} 

£3 = 

=\E &m ,k 

= 1 , 2 ,. 

..,« 3 : not confounded j 


and e £2 is confounded in c(yi) replicates and not confounded in u(yi) 
replicates of the basic pattern. We denote by N = 3 n sq the total number of 
observations. 

The following statements concerning the analysis are then obvious extensions 
of those made in Section 9.7. 


10.7.1 Intrablock Information 

1. Each E 8 h m g £ 3(/2 = 0, 1, 2) is estimated from all replicates, which implies 


var(gM) 


sqy 


j—1 


and 


SS(E 8m ) =sq3 


n—l 


E d m + E d m + E 6rn 


2. Each e £2 is estimated from qu(yt) replicates and hence 

var ( 砍 - 印） 


qu(y i )3 n ~ 


and 


(10.18) 


(10.19) 


SS 


(E y ^) = qu(n)3 n - 1 {[ 々 ] 2 + [ 定 H 2 + [E^f] (10.20) 


10.7.2 The ANOVAs 

The partitioning of the total sum of squares and its degree of freedom in the 
T|B-ANOVA is given in Table 10.9a with a further partitioning of the block 
and residual sums of squares in Tables 10.9b and 10.9c. All sums of squares 
associated with interactions are obtained in the usual way, following (10.18) 
and (10.20), except that SS(-) C is obtained from only those replicates in which 
the corresponding interaction is confounded. SS(E*.) is obtained only from the 
jth replicate in the ith repetition, and Yl'ij an d Hj denote summation over 
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Table 10.9 T|B-ANOVA for Partial Confounding 


Source 

d.f. 

SS 



a. Basic Partitioning 


x P \o 

sq — \ 

Usual 


X^*\0,x p 

sq(3 n ~P - 1 ) 

Usual 


X r *\3,X p ,X^ 

2n2 + 2«3 

SS(E y ^) + J2 SS(E s '") 

I\3,X p ,X^,X t * 


1 rn 

Difference 


Total 

N - 1 

Usual 



b. Partitioning of Block Sum of Squares 


x^*\o,x p 

sq(3 n ~P - 1 ) 



{E ak } 

2n\ 

X! ss (五 
lc 


{則 

2n2 

^SS(£ y Oc 

t 


[E ak x reps} 

2n\{sq — 1 ) 

EE ss ( £ 》. 

k ij 

-J2 SS (E a 〜 

k 

[E ae x reps} 

(n) - 2n 2 

t 

EE' SS ( 碎〜 

t U 

SS(£ y Oc 

t 


c. Partitioning of Residual Sum of Squares 


I\0,X p ,X^,X T * 




[E y ^ x reps} 

2q ^ u(yi) - 2^2 

i 

^ U 

- J2 SS{E y t) 

i 

[E 8fn x reps} 

In 认 sq — 1 ) 

EE SS ( 畸 )- 

m ij 

- J2 SS(£S 

m 


all replicates in which E n is confounded and not confounded, respectively (see 
Section 9.7). 

The B|T-ANOVA in its basic form is given in Table 10.10a with a partitioning 
of the block sum of squares in Table 10.10b. The only sum of squares that needs 
to be explained is that associated with vs. E y ^]. More specifically this sum 
of squares is associated with the comparisons 

(E^ - E^) vs. (E^ - E^) vs. (E Y 2 g - E Y 2 l ) (10.21) 

where E y h l is obtained from the qu(yi) replicates in which E yt g £2 is not 
confounded and E y h l is obtained from the qc(yi) replicates in which E y ^ is 
confounded. Since E^ e + + = 0 and + E^ 1 = 0, it follows 
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Table 10.10 B|T-ANOVA for Partial Confounding 


Source 

d.f. 

SS 


a. Basic Partitioning 


x P \o 

sq — l 

Usual 

X T *\0,X p 

3 n - 1 

Usual 

X^\3,X P ,X Z * 

sq(3 n ~P -l)-2ni 

Difference 


VR 

From Table 10.9 

Total 

N - 1 

Usual 

b. Partitioning of Block Sum of Squares 

X^\J,X pi X T * 

sq(3 n ~P - l)-2ni 


vs. 

2ri2 

See (10.24) 

{E ak x reps} 

2n\(sq — 1 ) 

From Table 10.9b 

[E y ^ x reps} 

^2 c (n) - 2n 2 

t 

From Table 10.9b 


that the comparisons (10.21) and hence the associated sum of squares carry 2 
d.f. This sum of squares is obtained in the usual way. Define 


- E y h e = Xl l (h = 0,1,2) 


Then 


SS (E y ( 


VS. 




X? 


( 10 . 22 ) 


where V Yi is the variance - covariance matrix of 



A 

47 


It is easy to see that 


n qu(ye)c(y e )3 n 




and hence 


〜 1 _1 — ^«(K£)c(n)3 ,!_1 / 2 

I 」 = ^ l- 1 
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Then (10.22) reduces to 

SS ( 宕 ye vs. E Yl ) = qu{y ^ c{ - yi)?,n Y^[ x Vf (10-23) 

s h 

Summing (10.23) over all E yi e £2 yields 

ss({£^ vs. Wf (10-24) 

s t h 

which is a sum of squares with 2 n 2 d.f. and which depends only on block effects 
and error. 


10.7.3 Tests of Hypotheses 

We can test the hypotheses 

H 0 : E y Q l = E\ l = E\ l = 0 (f e S 2 ) 

and 

H 0 : E S 0 m = E\ m = E^ m =0 (E Sm e £ 3 ) 


by using, in the T|B-ANOVA, the F tests as approximations to the randomization 
tests (see, e.g., 1.6.6 and 1.9.2): 


and 


MS (E&) 

MS (l\0,X p ,Xfi*,X T *) 

(10.25) 

MS (E Sm ) 

MS (l\3,X p ,Xfi*,X r *) 

(10.26) 


respectively, each with 2 and vr d.f. These are tests for no main effects or 
interactions. With regard to interactions, however, we must keep in mind that the 
total interaction among s factors has 2 s d.f. and hence has 2 s — l components 
of the form E a . For example, for 夕 = 3 and factors A, B, C, the components 
are ABC, ABC 2 , AB 2 C, AB 2 C 2 . To test then the hypothesis that there is no 
A x B x C interaction we use the F test: 

厂 [SS(A5C) +SS (ABC 2 ) + SS (AB 2 C) + SS (A5 2 C 2 )]/8 

F= Ms(/|a,z p ,z r ,z t *) 

with 8 and vr d.f. In this case an F test of the form (10.25) or (10.26) may not 
tell us very much about the 3-factor interaction, except that when one or the other 
of those tests is significant then we can conclude that A x B x C interaction is 
possibly present. 



ANALYSIS OF CONFOUNDED 3” FACTORIALS 


385 


For the general case of an 5 1 -factor interaction the following situations can 
occur for a particular system of confounding: 

1. All 2 s ~ l components belong to £2 and/or £ 3 . In this case we can pool the 
associated sums of squares in Table 10.9a and obtain an F test with 2 s and 
vr d.f. 

2. Some components belong to Si. In this case we can only use the sums of 
squares associated with the components in £2 and/or £ 3 . Great care has to 
be exercised in interpreting such a test. 

3. All components belong to £ 1 . In this case no test exists in the context of 
the T|B-ANOVA (but see Section 10.7.4). 


10.7.4 Interblock Information 

Assuming P* to be a vector of i.i.d. random variables with E{^) = 0 and 
E(m = cr 、 I ， we can obtain interblock information regarding E ak g £1 and 
E y ^ e £2 by considering E^ k and E【 £ (h = 0, 1,2) obtained from those replicates 
in which these interactions are confounded. We then have 

E{E a h k -E a h n = E a h k -K' 

with 

var (# - 砹 ） =+ 3^2) (10.27) 

and 

with 

var ( ⑦ - 砹） = qc{Y l yi _ x (〜 2 + 3 、 2 ) (10-28) 

If we define the pooled sum of squares associated with {E ak x reps} 
and {E y ^ x reps} in Table 10.9b as SS(Remainder) with v = 2n\(sq — 1) + 
2 q c (>^) — 2 行 2 d.f., then the hypotheses 

H 0 ： E^ k = E\ k = Ef = Q (E ak e £ 1 ) 

can be tested by using 

MS (E^) c 

F = 


with 2 and v d.f. 


MS (Remainder) 


(10.29) 
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10.7.5 Combined Intra- and Interblock Information 

Combined information is available and may be of interest for quantities of the 
form — for E ye e E 2 , or more generally for coE^ 1 + c\E^ £ + C 2 E? 
with co + ci + C 2 = 0. Using (10.19) and (10.28) we obtain 


_ ^Yi _ u(yt) w(E^ - E^) + c {yd w f (e [ 1 - E^) 
h h, u{yt) w + c{yt) w f 


(10.30) 


with w = l/cr^, w r = l/(cr^ + 3 P a 彦 ). Letting w/w , = p, we obtain 






(10.31) 


10.7.6 Estimation of Weights 

For the practical use of (10.30) and (10.31) in connection with, for example, 
the combined estimate of a{x) — a(z) for two treatment combinations x f and z’, 
using (10.15), we usually need to estimate w and w f (or p). As always 

i = MS(/|3,X p ,Z r ,X T *) (10.32) 

which is obtained from the T|B-ANOVA of Table 10.10. 

For the estimation of w r by the Yates procedure we use the B|T- 
ANOVA of Table 10.9. The two components of SS(Z^*|3, Z T:(: ) are 

SS(Remainder) as defined in Section 10.7.4 above and SS (£々 vs. E y ^ 
given by (10.24). Now 

E SS(Remainder) = v(crj + 3 P crj) (10.33) 

and, after straightforward derivation, 

E SS (E y t vs. = 2 a ； + 3 P aj (10.34) 

Hence, from (10.33), (10.34) and Table 10.10, 

E Z T ») = a e 2 + A 3 P aj (10.35) 

with 

u (yd + v 

^ 1 - - 

sq(3 n ~P — l) — 2n\ 


(10.36) 
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and v as defined in Section 10.7.4. It then follows that 

af+?,P aj = ^ MS(I\J,X p ,Xfi*,X r * 

+ |mS(XHUp ， A*) (10.37) 

with A given by (10.36). 

There are, of course, other methods of estimating the weights as described 
in Section 1.11. In any case, Satterthwaite’s procedure (Satterthwaite, 1946; see 
also 1.9.7.7) must be used to obtain the degrees of freedom associated with the 
estimator given in (10.37). 


10.8 NUMERICAL EXAMPLE 

We consider the 3 2 factorial in blocks of size 3, confounding the 2-factor inter¬ 
action component AB with blocks. The data as well as the intrablock analysis 
(using SAS PROC GLM) and combined intra- and interblock analysis (using 
SAS PROC MIXED) are given in Table 10.11. We shall comment briefly on 
these analyses. 

10.8.1 Intrablock Analysis 

We find a] = MS(/|3, X p , X p * ， X T *) = MS(E) = .8333. MS(E) (with 6 d.f.) 
is used to test hypotheses about A, B, and A x B by forming F ratios with 
the respective type III mean squares in the numerator. Concerning the A x 5 
interaction, we know, of course, that it has only 2 d.f., which are associated with 
the interaction component AB 2 . 

10.8.2 Combined Analysis 

In this example AB is confounded in both replicates, that is, AB belongs to £i, 
whereas A, B, and AB 2 belong to £ 3 . This means that only interblock infor¬ 
mation is available for AB and only intrablock information is available for A, 
B ，and AB 2 . Thus, the combined analysis for A and B yields the same results 
as the intrablock analysis. We can verify this easily by comparing the F ratios 
and P values using the type III MS in PROC GLM and PROC MIXED, respec¬ 
tively. In both cases we find for A: F = 126.87, P = .0001, and for B: F = 2.07, 
P = .2076. 

To test the hypothesis that there is no A x 5 interaction we obtain 

SS(A x B) = SS(AB) + SS(AB 2 ) (10.38) 
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Table 10.11 Analysis of 3 2 Factorial in Blocks of Size 3 
(With AB Completely Confounded) 


data three ； 

input rep block A B y @@； 
datalines ； 

1100 11 1112 14 1121 20 

1202 20 1211 23 1220 27 

1301 20 1310 27 1322 30 

2400 20 2412 25 2421 29 

2502 27 2511 30 2520 32 

2601 31 2610 41 2622 43 


run ； 

options nodate pageno=l ； 
proc print data=three ； 
titlel 'TABLE 10.11 '； 

title2 '3**2 FACTORIAL IN BLOCKS OF SIZE 3'; 
title3 1 (WITH AB COMPLETELY CONFOUNDED)'; 
run ； 

proc glm data=three ； 

class rep block A B ； 

model y=rep A|B block(rep )； 

title3 'INTRA-BLOCK ANALYSIS'; 

run ； 

proc mixed data=three ； 
class rep block A B ； 
model y=rep A|B/ddfm=satterth ； 
random block(rep )； 

title3 'COMBINED INTRA- AND INTER-BLOCK ANALYSIS'; 
run ； 


Obs 

rep 

block 

A 

B 

y 

1 

1 

1 

0 

0 

li 

2 

1 

1 

1 

2 

14 

3 

1 

1 

2 

1 

20 

4 

1 

2 

0 

2 

20 

5 

1 

2 

1 

1 

23 

6 

1 

2 

2 

0 

27 

7 

1 

3 

0 

1 

20 

8 

1 

3 

1 

0 

27 

9 

1 

3 

2 

2 

30 

10 

2 

4 

0 

0 

20 

11 

2 

4 

1 

2 

25 

12 

2 

4 

2 

1 

29 

13 

2 

5 

0 

2 

27 

14 

2 

5 

1 

1 

30 

15 

2 

5 

2 

0 

32 

16 

2 

6 

0 

1 

31 

17 

2 

6 

1 

0 

41 

18 

2 

6 

2 

2 

43 
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Table 10.11 (Continued) 


INTRABLOCK ANALYSIS 


The GLM Procedure 


Class Level Information 


Class 


Levels 


Values 


rep 

block 

A 

B 


1 2 3 4 5 6 
0 12 
0 12 


Number of observations 18 


Dependent Variable : y 




Sum of 

Mean 



Source 

DF 

Squares 

Square 

F Value 

Pr > F 

Model 

11 

1136.777778 

103.343434 

124.01 

< . 0001 

Error 

6 

5.000000 

0.833333 



Corrected 

Total 17 

1141.777778 





R-Square 

Coeff Var 

Root MSE 

y Mean 



0.995621 

3.496101 

0.912871 

26.11111 



Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

rep 

1 

410.8888889 

410.8888889 

493.07 

< . 0001 

A 

2 

228.1111111 

114.0555556 

136.87 

< . 0001 

B 

2 

3.4444444 

1.7222222 

2.07 

0.2076 

A*B 

4 

464.2222222 

116.0555556 

139.27 

< . 0001 

block(rep) 

2 

30.1111111 

15.0555556 

18.07 

0.0029 


Source 


DF Type III SS Mean Square F Value 


Pr > F 


rep 

A 

B 

A*B 

block(rep) 


410.8888889 
228.1111111 
3.4444444 
18.7777778 
30.1111111 


410.8888889 
114.0555556 
1.7222222 
9.3888889 
15.0555556 


493.07 
136.87 
2.07 
11.27 
18.07 


<•0001 
<•0001 
0.2076 
0.0093 
0.0029 


2 
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Table 10.11 {Continued) 

COMBINED INTRA- AND INTERBLOCK ANALYSIS 
The Mixed Procedure 


Model Information 


Data Set 

Dependent Variable 
Covariance Structure 
Estimation Method 
Residual Variance Method 
Fixed Effects SE Method 
Degrees of Freedom Method 


WORK.THREE 

y 

Variance Components 

REML 

Profile 

Model-Based 

Satterthwaite 


Class Level Information 


Class Levels Values 


rep 

block 

A 

B 


2 12 

6 1 2 3 4 5 6 

3 0 12 

3 0 12 


Dimensions 


Covariance Parameters 2 
Columns in X 18 
Columns in Z 6 
Subjects 1 
Max Obs Per Subject 18 
Observations Used 18 
Observations Not Used 0 
Total Observations 18 


Iteration History 


Iteration Evaluations 


-2 Res Log Like 


Criterion 


0 1 42.27802731 

1 1 34.77498334 0.00000000 


Convergence criteria met. 

Covariance Parameter 
Estimates 


Cov Parm 


Estimate 


block(rep) 
Residual 


4.7407 
0.8333 
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Table 10.11 (Continued) 

Fit Statistics 

-2 Res Log Likelihood 34.8 

AIC (smaller is better) 38.8 

AICC (smaller is better) 41.2 

BIC (smaller is better) 38.4 



Type 3 

Tests of 

Fixed Effects 



Num 

Den 



Effect 

DF 

DF 

F Value 

Pr > F 

rep 

1 

2 

27.29 

0.0347 

A 

2 

6 

136.87 

<.0001 

B 

2 

6 

2.07 

0.2076 

A*B 

4 

2 

13.03 

0.0725 


from the type I SS in PROC GLM as 

SS(A x B) = 464.22 

with 4 d.f. Now we know from (10.16b) and its extension to completely con¬ 
founded effects that 

E {SS(A5)} = 20-2 + 2-3 {[AS 0 ] 2 + [AB x f + [AB 2 ] 2 \ (10.39) 

and 

E {SS045 2 )} = 2(a e 2 + 3 ct 2 } + 2 . 3 |[^ 5 2]2 + [A5f ] 2 + [Afi 2 2 ] 2 j (10.40) 

Hence, using (10.38), (10.39), and (10.40), 

E{MS{A^B)} = a^+ 3 -aj+ 3 - [AS,] 2 + ^ [AS, 2 ] 2 j (10.41) 

Under Hq (10.41) reduces to + We therefore need to obtain an estimator 
for this quantity that will then provide the denominator for the F ratio to test 
Hq. Using (10.37) and (10.32) it is easy to see that 

苟 + ㈣ = 0 - i) Ms(m ^ x T *) 

+ 1 - 
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where A is obtained from (10.36), recognizing that u(yi) = 0 since 已 2 = 伞 ， v = 
2, s = 2, q = l, n = 2, p = n\ = 1. Thus, A = 1 and 

o] + \aj = lMS(/|3, Z T *) +iMS(X r |3,^, W 

= \ (.8333 + 15.0556) = 7.95 

using results from the ANOVA table. Finally, 

MS(A x B) 116.06 

「 = - = 14.60 

7.95 

This value is comparable to the corresponding value 13.03 obtained with PROC 
MIXED, which uses REML to estimate the variance components cr 》and a,. 

The degrees of freedom for the denominator of the F ratio are obtained by 
Satterthwaite’s procedure as 

_ [(.8333 + 15.0556)/2] 2 _ 

df _ /.8333\ 2 / /15.0556\ 2 I 

( 丁 ) M 丁 ) / 2 

which is comparable to Den DF = 2 in the PROC MIXED output. 


苟 + i 芍 



CHAPTER 11 


General Symmetrical Factorial Design 


11.1 INTRODUCTION 


In the preceding chapters we have discussed at great length the design and analy¬ 
sis of factorial experiments with two and three qualitative levels. In both cases the 
development is based upon orthogonal partitions of the complete set of treatment 
combinations and comparisons among the resulting subsets. These partitions are 
based on solving certain equations or sets of equations using only elements from 
the (mathematical) field of residue classes mod 2 and mod 3, respectively and 
elementary facts about ordinary arithmetic mod 2 and mod 3, respectively. The 
question then is whether this can be extended. 

Consider, say, arithmetic mod 4. Addition with an additive identity and mul¬ 
tiplication with a multiplicative identity are easily defined, for example, 


2 + 3= 1 


and 


2x3 = 2 

However, this is not enough. We want to set up families of hyperplanes defined 
by, for example, 


x\ + 2x2 = 0, 1, 2, 3 

and in order to achieve orthogonal partitions of the 4" treatment combinations, 
we need the result that the equation in the unknown x 


ax = b (a ^ 0) 


Design and Analysis of Experiments. Volume 2: Advanced Experimental Design 

By Klaus Hinkelmann and Oscar Kempthorne 
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has a unique solution. We see immediately that arithmetic mod 4 does not have 
this property; for example, 


2x = 2 


is satisfied by x = 1 and x = 3 and 


2 x = l 

is not satisfied by any x. So we see that the simple arithmetic mod k does not 
extend immediately to any 众 > 3. It does so, however, if A: is a prime number, 
p say. 

The addition and multiplication properties mod p of the set 5 = {0, 1, 2,, 
p — 1} are obvious. It only remains to show that 

ax = b (a ^ 0) 


has a unique solution mod p. Suppose we have two solutions x\, X 2 with x\ ♦ x:. 
Then 


a(x\ — X 2 ) = 0 

Since x\ — X 2 = d, say, e S, we have 

ad = 0 


but 


ad = 0 < —— > ad = cp 

with c e S. Since neither a nor d are divisible by p, this relationship cannot hold 
unless d = 0, and hence x\ = X 2 . So, division is unique, and hence S forms a 
field. We denote this field of p elements and arithmetic mod p by GF(p). Further 
generalizations can be made to k = p m , a prime power, using the Galois field 
GF(p m ) (see Appendix A). 

In this chapter we shall discuss the p n factorial as a generalization of the 
2 n and 3 n factorials and indicate how this, in turn, can be generalized to the 
s n = (p m ) n factorial. Much of the development in this chapter is due to Bose 
(1947b), Bose and Kishen (1940), Fisher (1942, 1945), Kempthome (1947, 1952), 
Rao (1946a, 1947b), and Yates (1937b). 
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11.2 REPRESENTATION OF EFFECTS AND INTERACTIONS 

We now have n factors A\, A 2 ,..., A n each at p levels; hence we have p n treat¬ 
ment combinations and p n — l d.f. among treatments. A treatment combination is 
denoted by the n-tuple x f = (xi, JC 2 ,..., x n ) with x/ G GF(p). We then consider 
a partition a! = a 2 ,..., a n ) with of/ G GF(p), but not all oti = 0, which 

partitions the set of p n treatment combinations into p sets of p n ~ x treatment 
combinations each through the equations 


a\x\ + OL 2 X 2 H - h a n x n = 8 


( 11 . 1 ) 


where 8 takes on all values in GF(p). That this is true can be seen as follows: 
Suppose for fixed a and 8 we have at ^ 0 for some i. We can then choose all 
xj(j ^ i) freely (xj = x* say) and obtain, from (11.1), 

ociXi = 8 — ^2 a j x： j = 5*, say 
j^i 

Because of uniqueness of division in GF(p),Xi is then determined uniquely. 
Since each Xj(j ^ i) can take on p different values x*, we have p n ~ l solutions 
to (11.1). Comparisons among the p sets of treatment combinations generated 
by (11.1) define the p — I d.f. associated with the effect or interaction E a = 

Let S(x; a, 8) denote the set of treatment combinations x r = (x\,X 2 , ..., x n ) 
satisfying (11.1). Then a contrast belonging to E 01 can be defined formally as 
c sS(x m , a, 8) with 8 e GF(p) and q = 0. 

Excluding = (0, 0,… ， 0) there exist p n — 1 partitions a. However, for any 
given a the partitions a, 2a, l)a are identical since they lead to the 
same equations of the form (11.1). Hence we have (p n — l)/(p — 1) distinct 
partitions, each accounting for p — I d.f., and consequently, these (p n — 1)/ 
(p — 1) partitions account for the p n — \ d.f. among treatments. In order to have 
a unique enumeration, we restrict the first nonzero a/ in a partition cc to be equal 
to 1. The partitions then define the main effects, 2-factor interactions,..^-factor 
interaction, designated in general by E a = A^ 1 A^ 2 - - - A^ n with the convention 
that a letter A/ with at = 0 is dropped from the expression. 

As a consequence of partitioning the totality of p n — l d.f. into (p n — 1)/ 
(p — 1) sets of p — 1 d.f. each, a 众 -factor interaction, for example, A\ x 
A 2 x ••• x Ak, consists of (p — l) k ~ l components denoted by, for example, 
A 1 A ^ 2 - - - A^ k , where o^ ， 0 ^ 3 ,. •., take on all nonzero values in GF(p). As 
an example, in a 5 n design the 2-factor interaction A x B consists of AB, AB 2 , 
AB 3 , AB 4 . 

We also note that for two distinct partitions a! = (cq, ... ， a n ) and p r = 
0 ^ 1 , 卢 2 , … D the equations 


ot\x\ + U 2 X 2 H - h a n x n = 5i 


( 11 . 2 ) 
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and 


P\ x \ + ^ 2 X 2 + • • • + Pn x n = ^2 (11.3) 

are satisfied by p n _ 2 treatment combinations. That this is so follows from the 
theory of linear equations: Because the partitions a and P are distinct, we can 
find of ， i , otj , pi , such that 


Of/ 

Pi 



^0 


We can then choose all Xk(k ^ i, j) freely, say Xk = x^. Then (11.2) and (11.3) 
reduce to 

atXi + ajXj = 8\ — ^ otkx^ = 8* 

昧 j 

p i x i -^p j x j = 8 2 - J2 Pk4 = $ 


These equations have a unique solution in Xi and xj, and since each Xk can take 
on p different values x^, we have p n - 2 different solutions to (11.2) and (11.3). 
This is true for all 52 = 0, 1,... ， p — 1 and 8 \ fixed, which implies that the p n_1 
treatment combinations satisfying (11.2) can be divided into p distinct sets of 
p n ~ 2 treatment combinations each satisfying one of the p equations (11.3) with 
<^2 = 0, 1,..., p — 1. Any contrast belonging to E a is therefore orthogonal to 
any contrast belonging to E 技 . It is in this sense then that the partitions a and )3, 
and hence the interactions E a and , are orthogonal. 


11.3 GENERALIZED INTERACTIONS 

Any treatment combination x that satisfies the equations (11.2) and (11.3) also 
satisfies the equation 

(oil + Pl)x\ + («2 + + ... + (Qf w + =^ 1+^2 (11.4) 

As and 82 take on all values in GF(p), 8 \ + 82 takes on each value in GF(p) 
exactly p times. Combining all x that satisfy (11.4) for a particular value of 
+ 82 , we have partitioned the treatment combinations according to 

(Qfl + Pl)x\ + («2 + Pl) x 2 + ..• + (“《+ Pn) x n = 8 

with 8 e GF(p). This is, of course, the partition a + )9, and the corresponding 
interaction is, in conformity with previous usage, a GI of E 01 and . 
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We can make the same statements more generally about 

(Qfl + XP\)X\ + ((X 2 ~h ^Pl ) x 2 + ... + {oi n + X^ n )x n = + 入 52 

with X e GF(p), X ^ 0. This then leads to partitions a + 入 and hence to the 
totality of p — l GIs £ a +^p 0 f E a and . These GIs are defined by comparisons 
among the sets of treatment combinations satisfying 

(Q^i + ^^\)X\ + ((X 2 + 人卩 2) x 2 + * * * + (oi n + = 各 (11.5) 


with 8 e GF(p) and each X e GF(p), 入 # 0. 

Formally the GIs are obtained by multiplying the corresponding letters raised 
to certain powers into each other, reducing the powers mod p and modifying 
the powers (if necessary) such that the first letter included appears with power 
unity by multiplying every power by the same appropriate value in GF(p). For 
example, in a 5 3 factorial the GIs of AB and AC 2 are 

(AB) x (AC 2 ) = A 2 BC 2 = (A 2 BC 2 ) 3 = AB 3 C 

(AB) x (AC 2 ) 2 = A 3 BC = (A 3 BC) 2 = AB 2 C 2 

(AB) x (AC 2 ) 3 = A 4 BC = (A 4 BC ) 4 = AB 4 C 4 

(AB) x (AC 2 ) 4 = BC 3 

More generally we have the following theorem. 


Theorem 11.1 The total number of GIs among q interactions E ax , E a2 ,... ， 
E 01 ^ is given by 


0 。， P) = g O - 1)) _1 (11-6) 

Proof The q interactions are defined by the q sets of equations 

aj\x\ + aj 2 X 2 H - aj n x n = 8j 

with 8 j e GF(p), y = 1, 2,..., or for short 

a f jX = 8 j (j = 1,2,... ,q) (11.7) 

Any treatment combination x that satisfies (11.7) for a given set 8 \, 82 ,..., S q 
also satisfies any of the equations 


^ kjd'jX = ^ XjSj (11.8) 
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with 入 ） e GF(/?) except ( 入 i, 入 2 ,… ，入 g) = (0, 0,… ， 0). We also exclude any 
A/ =( 入 1 , 入 2 ,... ，入 with only one nonzero 入 y since such equations define the 
q interactions themselves. Given then a particular X, if we let the 8 j associated 
with the nonzero components of X take on all values in GF(p)，we obtain the 
partition A/a that defines the interaction E X a , which by definition is a GI of 
E ai , E 012 ,..., E a q • To have a unique enumeration of all possible X and hence 
all possible GIs, we restrict the first nonzero component in X to unity. All other 
components then take on all possible values in GF(p). Hence the possible number 
of X vectors is 



which is obtained by considering all admissible X with 2,3,... ,q nonzero com¬ 
ponents, respectively. □ 


11.4 SYSTEMS OF CONFOUNDING 

With a p n factorial and blocks of equal size the only block sizes are k = (l < 
n). lfi< n, we need to confound certain interactions with blocks. More precisely, 
for a p n factorial in blocks of size we have p n ~^ blocks and hence we must 
confound {p n ~^ — 1)/(/? — 1) interactions with blocks. To find such a system of 
confounding, we first state the following theorem. 

Theorem 11.2 If in a p n factorial with equal block sizes p^(< p n ~ 2 ) two 
interactions E a and are confounded with blocks, then so are their GIs £ a+ ^, 
入 e GF(p), 入 # 0. 

Proof Consider the equations associated with E a and E^: 

a\X\ + ci 2 X 2 H - h a n x n = 81 

Pl x l + + . . • + Pn x n = ^2 ( 11 . 9 ) 

For any pair ( 8 \, 82 ) the equations (11.9) are satisfied by a set of p n - 2 treatment 
combinations x r = (xi, X 2 ,..., x n ) denoted by S(x; a, 5i; 於， 82 ) say. Each set 
S(x^ a, 8 \； P ， 82 ) makes up one or several blocks. A contrast belonging to E a is 
given by 

cs i S(x-,oc,S l ^,8 2 ) ( 11 - 10 ) 

劣 1 h 
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and a contrast belonging to by 

so; a ， jM 2 ) (11.11) 

h ^1 


with 山 2 = 0. The contrasts (11.10) and (11.11 )， when evaluated 

in terms of treatment and block effects, are linear functions not only of treatment 
effects but also of block effects as each set S(x; a, 8 \； ) 9 , 82) contains a sum 
of block effects multiplied by a common constant, or q 2 . Now, for some 
入 e GF(p), 


S(x;a + X^,8) = ^ S(x;cc,8i-,p,8 2 ) (11.12) 

^ 1^2 

5 i+ 入占 2=5 


Hence a contrast belonging to can be expressed as 


as S(x; a + 入 )3, 8) = y^ as 
8 8 


^2 S(x;a,8i-, P ， 8 2 ) 

<5i,^2 

_5i+A52=5 


(11.13) 


with as = 0. Since (11.13) is of the same form as (11.10) and (11.11), that 
is, a linear combination of the same 5(x; a, 8 \； P, 82 ), it is also a function of 
block effects, and hence is also confounded with blocks. This holds for 

all 入 e GF(p), hence for all GIs of E a and . □ 

We now consider specifically a system of confounding of a p n factorial in 
blocks of size p l . 

Theorem 11.3 A system of confounding for a p n factorial in blocks 
of size is completely determined by q = n — i independent interactions 

/ 7«1 / 7«2 J7<^q 

i—t , i—j , . « « ， ±-j • 


Proof Since with any two interactions their GIs are also confounded with 
blocks, it follows immediately that with q interactions all their GIs are also 
confounded with blocks. By Theorem 11.1, the total number of GIs is 0(^, p) as 
given in (11.6). Hence together with the q independent interactions we confound 


q + (p(q, p) 



(P — l) h 
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interactions. Each interaction carries p — I d.f. The total number of degrees of 
freedom confounded with blocks is then 



This is the number of degrees of freedom among the p n ~ l blocks of size p l . □ 

Once the n — £ independent interactions have been chosen, it remains to obtain 
the composition of the blocks. One way to do this, obviously, is to consider the 
q = n — i equations associated with E 011 ， E ai ， … ， E a< i ： 

Oi\\X\ + Of 12^2 H - h Oi\ n X n = 8\ 

0(21X1 + Qf22^2 H - h a2nX n = h 

(11.14) 


^^ 1 义 1 + ^q2^2 + . . . + aq n X n = 8q 

for all possible p n ~^ right-hand sides. For fixed (8\, <52,, S q ) the equations 

(11.14) are satisfied by p l treatment combinations. Hence for all possible choices 
of (8\, ^ 2 ,, S q ) we obtain p q = p n - 1 distinct sets of treatment combinations 
that then form the blocks. 


11.5 INTRABLOCK SUBGROUP 

Another way we have already used with the 2 n and 3 〃 factorial is to first obtain the 
IBSG and then use it to generate the remaining blocks. Denoting the equations 

(11.14) with 8i = 0 (i = 1,2,q) by (11.14’)，we first prove the following 
theorem. 

Theorem 11.4 The treatment combinations x f = (xi, a ： 2 , ..., x n ) satisfying 
the equations (11.14’）form a group, the IBSG, of size p l with componentwise 
addition as the operation among elements. 

Proof (i) The solutions to (11.14’）contain the additive unit element 
(0, 0, …， 0); (ii) if {x\,X 2 ,.. ., x n ) satisfies (11.14 ’)， then (Xx\, Xx 2 , ..., ^x n ) 
with 入 e GF(p) also satisfies (11.14 ’)； (iii) if (x\,X 2 , . . ., x n ) and (y\, j 2 ,. .., y n ) 
satisfy (11.14’)，then (x\ + ji ,^2 + y 2 , … ，知 + y n ) satisfies (11.14’). □ 

Suppose now that ? is in the IBSG and that z! = (zi, Z 2 , … ， z n ) is a treat¬ 
ment combination not belonging to the IBSG. Then the set {x f + z!\ x' G IBSG} 
is a set of treatment combinations satisfying Eq. (11.14) for some vector 
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( 8 \, 82 ,, S q ) other than (0, 0,… ， 0). Hence these treatment combinations 
form a block. We shall refer to 7 ! as the generator of this block. This process can 
be continued by choosing successively altogether p n ~^ such generators. These 
generators can obviously be found simply by inspection after another block has 
been constructed in this way. In certain situations it is possible, however, to give a 
complete characterization of the generators z\, Z 2 , • • • z w (w = p n 一之一 1 ) ， say, 
where z• = (zn,Zi 2 , … ， Zi n )- 

Consider a particular system of confounding for a p n factorial in blocks of 
size p l . Let a\, cc q -\-\, …， be the partitions corresponding to the 

confounded interactions, where v = (p n ~ £ — l)/(p — 1) and a\, 0 C 2 , • • •, c^q(Q = 
n — i) represent the independent interactions. We then consider the set 


{Xcci mod p: i = 1,2,..., v; X e GF(p ), 人 # 0} 


which we denote by 

G: {aj-J = l,...,p n ~ e -1} 

We note that if we adjoin to G the element oCq = (0, 0,..., 0) we have a group 
of p n ~ l elements of order n — i with a\, « 2 , • •, as the generators. Denote 
this group by G*. 

We now state the following theorem. 

Theorem 11.5 For a given system of confounding for a p n factorial in blocks 
of size p l the composition of the blocks is obtained by using the IBSG together 
with the generators Zj = ccj with ctj G G provided the a j are not contained in 
the IBSG. 


Proof We have to show that each Zj = oc 7 satisfies a different set of equations 
(11.14), that is, with different right-hand sides. 

Suppose that aj and ay(j ★ /) satisfy the same equations. It follows then 
from (11.14) that 

oc\{oij — ctj') = 0 
c^(a 广 oijt) = 0 


a' q (aj - ay) = 0 


that is, cl j — a j ， satisfies (11.14)’. Now a, — a, e G* and, by assumption, the 
only element in G* that satisfies (11.14/ is ocq. Hence = ay. □ 
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Example 11.1 Consider a 3 5 experiment in blocks of size 3 3 . Suppose 
ABCD ， BCDE 2 , and hence AB 2 C 2 D 2 E 2 and AE are confounded with blocks. 
The IBSG is determined by 

Xi + X2 + X3 + X4 = 0 
^2 + ^3 + ^4 + 2^5 = 0 

and it can be seen that = ( 1 , 1 , 1 , 1 , 0 ), a; = ( 0 , 1 , 1 , 1 ， 2 )， 

(1, 2, 2, 2, 2), = (1, 0, 0, 0, 1) do not satisfy these equations. Hence the gen¬ 

erators of the eight blocks besides the IBSG are a’” a’ 2 , a’ 3 , 0 ^ ， a f 5 = 2a\ = 
(2, 2, 2, 2, 0), ^ 6 = 2^ 2 = (0, 2, 2, 2,1,), = 2^ = (2, 1, 1, 1, 1), < = 
= (2,0, 0,0, 2). □ 


11.6 ENUMERATING SYSTEMS OF CONFOUNDING 


Concerning the number of possible systems of confounding, we state the follow¬ 
ing theorem. 


Theorem 11.6 The total number of systems of confounding of a p n factorial 
in blocks of size p l is 


(p n - 1 )(〆 _ p) • • • (p n _ p n - l ~ x ) 

(pn-l _ \)(^pn-i _ p) . . . 、 pn-t — pn-t-l^ 


(11.15) 


Proof It follows from Theorem 11.3 that the total number of systems of 
confounding is the same as the number of distinct sets of n — l independent 
interactions. This number can be obtained as follows: The first interaction can 
be chosen in (p n — 1 )/(/? — 1 ) ways; the second in (p n — p)/(p — 1 ) ways; the 
third in 


p n - p , 丄 ^ 、 p n - p 2 

- -- 1 - 0 ( 2 , p )= -— 

P — \ P — 1 

ways, with c/)(q, p) as defined in ( 10 . 6 ); the fourth in 


pn - p2 --i-H3,p) + H2,p) pn _ p3 


p 


p 


ways; …； the (/:+!) — th(k > 3) in 


P n -P 


k-\ 


P 


1 - 0 (/: ， p)-\-(j){k - 1 , p) 


(11.16) 
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ways, and since 


c/)(k, p) - (j){k -l, p) = p k ~ l - 1 


(11.16) becomes (p n — p k )/{p — 1). Hence the total number of sets generated 
in this way is 




Any one set will, however, have been enumerated in 


r,n—t 







(11.17) 


(11.18) 


different ways, which can be derived in the same way as (11.17). Dividing (11.17) 
by (11.18) gives the stated result (11.15). □ 


11.7 FISHER PLANS 

In many cases (11.15) may be too large to actually write out all possible systems 
of confounding. A question of practical interest then is whether there exists a 
certain type of confounding and how one can find it if it exists. In particular, one 
may be interested to know whether there exists a system of confounding such 
that main effects and low-factor interactions remain unconfounded. 


11.7.1 Existence and Construction 

A first and important result in this direction is the following theorem. 

Theorem 11.7 (Fisher, 1942, 1945) The maximum number of factors with 
p levels that can be accommodated in a design with blocks of size p l {i > 2) 
such that main effects and two-factor interactions remain unconfounded is 

— l 

n = - - (11.19) 

P - 1 

Proof The proof is based on the following correspondence between the 
n = (p 么一 l)/(p — 1) factors F\, F 2 ,, F n each at p levels and the (// — 1)/ 
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(p — 1) effects and interactions of a p l factorial with factors A\, A2,..., A^: 

Fi = A x 
F2 = A2 
F 3 = A 1 A 2 
F4. = A1A2 

F n = A X A P 2 ~ 1 ■ ■ ■ A p ~ l 

or, if we label the factors F by a, where a! = (a\,a2, ..., oti) designates a 
partition in the system, 

with oti G GF(p)( / = 1,2,..., i), not all at = 0 and the first nonzero at equal to 
unity. There are, of course, n such a’s and we denote the set of these a's by A. We 
then create the following two-way table: The rows are labeled by the p l factorial 
combinations x f = (;q ， X 2 ,... ， X() and the columns by the (〆 一 l)/(p — 1) 
interactions A^A^ 2 - - - that is, by the F a . The table entries thus are the inner 
products cc r x mod p [we note that these are the subscripts in the representation 
of r(x) in terms of main effects and interactions]. Each row in this table then 
represents a treatment combination in the n factors F\, F2,... ， F n ， the level 
of F a in the treatment combination z{x) corresponding to row x being a f x. We 
thus have treatment combinations for n factors. These treatment combinations 
constitute the IBSG for the system of confounding to be considered. That this is 
so can be seen as follows: (i) The operation in this set is componentwise addition 
mod p; (ii) the set contains the addition identity; (iii) if the set contains 


z(x)= 


X\,X2,X\ +X2, . . . , Xi + (p 



then it also contains 入 jc ， 入 e GF(p)，since with the p l treatment combination x 
we also have the treatment combination \x because the set of treatment 
combinations forms a group; and (iv) if the set contains 


and 


i 

Z(X) = (x\,X2,Xl +X2, … ，文 1 + (p - 1) 厂 ，) 

i=2 

t 

z(y) = (yuy 2 ,yi + %’… •，: yi + 0- l)^^) 

i=2 
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Each row represents a treatment combination (zi, Z2, Z3, Z4) for the factors 
F\, F 2 , F 3 , F 4 . The nine treatment combinations represent the IBSG for a system 
of confounding for a 3 4 factorial in blocks of size 3 2 . □ 

If we write the IBSG as a w x // array, W say, that is, each column repre¬ 
senting a treatment combination, the proof of Theorem 11.7 leads immediately 
to the following corollary. 

Corollary 11.1 The IBSG for a p n factorial in blocks of size p l , written 
as a matrix W, is an orthogonal array of size p £ , with n = — l)/(p — 1) 


then it also contains z(x) + z(j) because with x and y the treatment combi¬ 
nations also contain x y. 

To show that this IBSG leads to a system of confounding that does not 
confound any of the main effects Fi (i = 1,2 ,... ,n) and any of the 2 -factor 
interactions Fi Fj ,y e FG(p), y ^ 0, i, j = 1,2,... ,n,i ^ j, we have to show 
that in the IBSG (i) each level of each factor F/ occurs the same number of times 
and (ii) each level combination for any two factors Fi, Fj occurs the same num¬ 
ber of times: (1) The level z G GF(p) of factor F a occurs in the IBSG whenever 
u r x = z. Since at least one at # 0 in a, this equation has 〆 — 1 solutions for every 
z G GF(p) and every a e A. (2) The level combination (z\, zi) of factors F a and 
Fp occurs whenever a'x = z\ and p f z = Z 2 . As shown previously (Section 11.2), 
there exist exactly p ^~ 2 such p l treatment combinations x that satisfy these 
equations and that hold for every a, P e A and every z\, Z 2 GF(p). □ 

We illustrate this theorem by the following example. 

Example 11.2 Let p = 3 and 1 = 2, that is, n = A. We set up the table, 
letting A\ = A, A 2 = B, 


Fi 

f 2 

F 3 

f 4 

A 

B 

AB 

AB 2 

x r a f = (1, 0) 

(0,1) 

(1,1) 

(1,2) 
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constraints, p symbols, of strength 2 and index p 1 - 2 , that is, 

OA \p e , ^^ ， p ， 2. ， p e ~ 2 

L p-i 

Such an orthogonal array was called by Rao (1946a) a hypercube of strength 
2. More generally this leads to the following definition. 

Definition 11.1 Let there be n factors each at p levels. A subset consisting 
of 〆 of the possible p n treatment combinations is said to be a hypercube of 
strength d if all combinations of any d of the n factors occur p l ~ d times. Such 
an arrangement is denoted by [n, p, i, d]. 

Still another way of expressing the result of Theorem 11.7 is the following 
corollary. □ 

Corollary 11.2 For a p n factorial experiment there exists a system of con¬ 
founding with main effects and 2-factor interactions unconfounded if the block 
size k is such that 

k > n(p — 1) (11.20) 

For p = 2,3, (11.20) is, of course, the result alluded to in Sections 8.2.3 and 
10 .6, respectively. 

11.7.2 Identifying System of Confounding 

Having constructed a system of confounding using the IBSG of Theorem 11.7, 
the question may then be asked: Which of the interactions in the p n factorial 
are confounded? Let z’ = (zi, Z2, … ，“） denote a treatment combination of the 
p n factorial with the factors F\, F 2 ,..., F n . We then know that the interaction 

Fp - - - Fn n [yi G GF(/?), i = 1,2,... ,n, not all yi = 0, the first nonzero // 

equals unity] is confounded with blocks if and only if 

VIZI + Y2Z2 H - h YnZn = 0 mod p (11.21) 

for every z! in the IBSG. The IBSG contains elements that form an addi¬ 

tive group of order l, that is, the group is generated by i treatment combina¬ 
tions z’ p z’ 2 , .••，<, say with z • = (z/i, z ， 2 , … ， Zin )，/ = 1, 2, … ，& The system 
(11.21) then reduces to 

mil + V2Z12 H - h YnZln = 0 

YIZ21 + Y2Z22 H - h YnZln = 0 

( 11 . 22 ) 

Vizn + Yizn H - h y n Un = 0 
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Equations (11.22) in the unknowns yi, y 2 , •.. ， have p n ~^ solutions. One of 
these solutions is (0, 0, …， 0) so that p n ~ i — 1 nontrivial solutions remain. How¬ 
ever, if y' = (yi, 72, ... ， ） 4 ) is a solution, then 入 〆 with 入 e GF(p) is a solution 
also. Hence there are c = (p n ~^ — l)/(p — 1) distinct solutions y 2 , ... ， y c , 
say, and these then correspond to the confounded interactions E y i , E y2 , … ， E Vc 
in the F \, F 2 ,..., F n system. Of course, of the c interactions only n — i are 
independent interactions. 

To summarize: The general idea is to (i) obtain a set of generators 
z’p z;， …， of the IBSG, (ii) obtain all solutions to (11.22), and (iii) eliminate 
all redundancies from these solutions to obtain c distinct solutions m 2 ，...，JV 

We note that this confounding detection method works, of course, for any 
system of confounding. This is just the formal statement of the procedure men¬ 
tioned in Section 8.4. The method of constructing a confounded design according 
to Theorem 11.7 is just one case in point where the confounded interactions are 
not known a priori. 

11.7.3 Application to Fisher Plans 

Returning now to the case of Theorem 11.7, what can we say specifically about 
yi ， y 2 ,..., y c ? An explicit and complete characterization of the confounded 
interactions can be given as follows. 

Let us return to Eqs. (11.22). We note that the treatment combinations 
z’p z^，...，are generated by the corresponding treatment combinations x f j = 
(xj\,Xj 2 , … ， Xji) (j = 1,2,..., V). Rewriting (11.22) as 

Y\Ci\x\ + ^ - 1 - Yn^ n Xl = 0 

y\d\x2 + Y2^2 x 2 H - h YnCt f n X2 = 0 


y\u\xi + yioi^Xi H - h YnCf- f n xz = 0 

and taking (xi, X 2 , … ， x^) = Ii, we obtain from (11.23) the equations 
anyi -\-oi2\Y2 H - Voi n \y n = 0 

Oi\2Y\ + OC22Y2 H - h OinlYn = 0 

(11.24) 

OiUYl + OL2iY2 H - h OintYn = 0 


which is a set of l equations in the n unknowns yi ， K2,. • • ， K", where aji is the 
zth component of ctj(j = 1 , 2 ,/ = 1 , 2 ,... ， €)• 
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To characterize the solutions to (11.24), we write the a j e A in what we shall 
refer to as standard order in that a’ v cc ’ 2 ,..., represent the l main effects 
Ai, A 2 ,..., respectively, that is, a' v is the vth row in the identity matrix 
Il(v = 1,2,..., i). From (11.24) we then obtain the equations 


~n~ 


Oi£+l,l 

«€+2,l . 



"n+i~ 


Y2 

= - 

叫 1，2 

⑽ +2,2 • 

. • ^n2 


n+2 

(11.25) 

n 



0^i+2,i • 



Yl-\-n 



Since 於 +i ， 於 +2, …， Kn can be chosen freely, we can obtain n — i 
independent solutions to (11.25) and hence n — i independent partitions 
r = {yj = (Yju Yj2, • •. ， Yjn)\ j = by taking successively for 

(n+i, n+2, • •. ， the rows of l n -t ，If we denote the jth row of l n ^n by e'., 
then (11.25) yields successively 

= (11.26) 

j = 1 , 2 ,..., n — i, or if we convert —to positive components mod p, 
(11.26) becomes 


y) = (p - 1) e)) (11.27) 

This is so since in each the first nonzero component is unity and hence 
adding p and multiplying by p — \ yield (11.27). From (11.27) we then have 
that the n — € interactions 


E y J = F\ 








(j = 1 ， 2, …， n — £) are confounded with blocks and so are their GIs, accounting 
for all p n ~^ — 1 d.f. among blocks. 

As an illustration consider the following example. 


Example 113 Suppose p = 2,1 = 3, and hence n = 1. The ct r in stan¬ 
dard order are a\ = (100) ， a; = (010), a’ 3 = (001), a r A = (110) ， a’ 5 = (101 )， 
a r 6 = (Oil), a r 7 = (111). Then from (11.27) y\ = (a» (1101000), / 2 = 
(oi f 5 ,e f 2 ) = (1010100), y f 3 = (a r 6 , e f 3 ) = (0110010), y\ = (a；, ^ 4 ) = (1110001). 
Hence F 1 F 2 F 4 , F 1 F 3 F 5 , F 2 F 3 F 6 , F 1 F 2 F 3 F 7 and all their GIs are 
confounded. □ 


Example 11.4 Suppose p = 3, l = 2, and hence n = 4. We have (see 
Example 11.2) a\ = (10), a; = (01) ， a ’ 3 = (11) ， a: = (12). From (11.27) we 
then obtain = (1120) ， y f 2 = (1202). Hence F 1 F 2 F 3 , F\F^F^ and their GIs 
F\ F4 and F 2 F 3 F 4 are confounded. The reader should verify that this is in 
agreement with the content of the IBSG given in Example 11.2. □ 
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11.8 SYMMETRICAL FACTORIALS AND FINITE GEOMETRIES 

It is of some interest to relate the concepts of a p n factorial experiment and its 
systems of confounding to the notion of finite geometries, for example, Euclidean 
and projective geometries. This line of thought was developed initially by Bose 
(1947b) while deriving some known results in a different way and this then led 
to further extensions. 

The p levels of the factors in a p n factorial experiment may be identified with 
the elements of the GF(p). The treatment combinations (x\,X 2 , … ， x n ) can then 
be thought of as the points of the Euclidean geometry EG(n, p) (see Appendix 
B). Any (n — l)-space in EG(n, p) satisfies the equation 


ao + aixi + 0 ^ 2 X 2 + - 1- a n x n = 0 (11.28) 

Hence the (n — 1)-space consists of p n ~ l points of the EG(n, p). Keeping 
(a\, oil, ..., a n ) fixed and varying ofo ^ GF(/7) yield p distinct (n — l)-spaces 
that together form, of course, the EG(n, p). More generally, a (w — 分） -space of 
the EG(n, p) is determined by q linearly independent equations 

a\o + ol\\x\ + a\ 2 X 2 H - h ot\ n x n = 0 

0^20 + 0i2lX\ + 0^22^2 H - h OL^n^n = 0 

— — 一 — ~ (11.29) 


OiqO + Oi q \X\ + a q 2X2 H - h 0 i qn x n = 0 

which are satisfied by p n ~ q points. 

It is clear that the partitioning of the EG(n, p) into (n — l)-spaces as given by 
(11.28) corresponds to the partitioning we have used earlier to define main effects 
and interactions. We represented such partitions by (ai, o?2, • • •, d n ), and two such 
partitions (a\, a 2 ,..., a n ) and (0i, P 2 , … ， Pn), say, are identical, that is, lead 
to the same set of p distinct (n — l)-spaces of EG(n, p), if (cq ， 0^2,. •. ， = 入 
(卢 1 ， Pi ， … ， Pn) for some 入 e GF(p). We know that the number of distinct par¬ 
titions is (p n — 1)/(/? — 1), which is also the number of points in the projective 
geometry PG(n — l, p) (see Appendix B). It is therefore convenient to identify 
(a\, a 2 ,..., a n ) as a point in PG(n — 1, p). 

With two points a! = (a\,a 2 ,..., a n ) and 〆 = ( 卢1 ，卢 2 ,… D G PG 
(n — 1, p) the points a! + 入於’，入 e GF(p), define a line, that is, a 1-space, in 
PG(n — 1, p). In terms of interactions this means that the points on the line rep¬ 
resent the GIs of E a and E 择 . This implies that for a p n factorial in blocks of 
size p n ~ 2 any two points a ’， 〆 in PG(n — 1, p) determine such a system of con¬ 
founding and the GIs, which are also confounded, are represented by the points 
on the line through a f and • 

This can be generalized easily, and we then have the following: A system of 
confounding for a p n factorial in blocks of size p k is determined by n — k points 
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in PG(n — 1 ， /7) such that the points do not lie on a subspace of PG(n — 1, p) 
of dimension n — k — 2 ox less, that is, the points are not conjoint. 


11.9 PARAMETERIZATION OF TREATMENT RESPONSES 


We have shown in Sections 7.4.2 and 10.3 how the true response of a treatment 
combination for 2 n and 3 n factorials can be expressed in terms of main-effect 
and interaction components. We shall now present such a parameterization for 
the p n factorial. 

In Section 11.2 we have partitioned the p n treatment combinations into p 
mutually exclusive sets of p n ~ l treatment combinations each through Eqs. (11.1), 
that is, 


a\X\ + 0 C 2 X 2 + - V ot n x n = 8 (11.30) 

with 8 e GF(p) and a fixed a' = (a\, a 2 ,..., a n ) with at G GF(p) (/ = 
1,2 ，...， n)a r ^ (0, 0, …， 0). The set of treatment combinations x r = 
(xi, X2 ,..., x n ) that satisfy (11.30) for a given 8 we denoted earlier by S(x; a, 8). 

Let a(x) denote the true response of the treatment combination x. We 
then define 


1 


E s = a(x * ) - 7 E a(x*) 


or, if we write for convenience 




P n 


J2 J2 a( ^ x * ) = M 


8eGF(p) x*eS(x;a,8) 

the overall mean, then (11.31) becomes 


^ = i E **)- 财 

^ V •氺 r CVv « X 、 


A ： *e5(:c;a ， 5) 


(11.31) 


(11.32) 


Since 8 = a f x, we can write (11.32) also as E^, , and we shall use both notations 
interchangeably. We thus have p numbers Eg, E ?， … ， with Eq + -h 
- h = 0. Any linear contrast among these, that is, 

c 0^o c l^l + ... + c p-l^p-\ 

with co + ci + ... + Cp-i = 0 belongs to the interaction E a = A^A^ 2 - - - A^ n . 
We can do this for each admissible partition a, of which there are (p n — 1)/ 
(p — 1). We have denoted this set by A. 

We then have the following theorem. 
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Theorem 11.8 The true response a(x) of a treatment combination x imi p n 
factorial can be parameterized as 

a{x) = M + E K' X (n.33) 

aeA 

Proof We examine the right-hand side (RHS) of (11.33) and show that it 
equals a(x). We have 

RHS = M+ 」一 y" V a(x*) - P " ~ 1 M (11.34) 

Pn-\ — 1 

1 aeA x*eS(x;oc,8) y 

and 

a(x*) = — - ^ . a ⑻ + E ^ a(x*) (11.35) 

oteA x*eS{x\cL,8) ^ aeA x*eS(x;ot,8) 

x*^x 

Now consider a specific x* ^ x. In how many terms of the second expression on 
the RHS of (11.35)，that is, for how many aeA, does a(x*) occur? It occurs if 
and only if 

cc f x = a’x* 

or if 

a / (x-x*) = 0 

or if 

a f u = 0 (11.36) 


for u = x* — x. Now at least one of the components in u is not zero. Hence 
(11.36) as an equation in the unknown ot has p n ~ l solutions. However, we have to 
exclude a’ = (0, 0,. •., 0)，and since with a also 入 a is a solution for 入 e GF(p), 
入 # 0, the number of distinct solutions to (11.36) is (p n ~ l — l)/(p — 1). Using 
this result together with (11.35) in (11.34), we obtain 


RHS = M 



p n - 1 

+ 7^T 


1 

p n — l 


a(x) 



P - 1 


p n - 1 


V* 

X 
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M 


广 1V 厂 ☆ ⑻ 


+ 


P‘ 


n— 1 


1 1 


P - 1 P' 


,n 一 1 


P — I p n 


p n M — a(a;)] 


P n -l , (P n ~ l - Dp\ , (P n — 1) _ (P n ~ l - 1) 


-1 


M 


P 


P 


(p-l)p n ~ l 


a(x) 


a{x) 


□ 


For an illustration of this theorem we refer to Example 10.2. 


11.10 ANALYSIS OF p n FACTORIAL EXPERIMENTS 

We consider a particular system of confounding in 5 types of replicates. These 
s replicates form the basic pattern, which is then repeated q times. The usual 
model for data from such an experiment is 

y = ^ + X p p^ X〆 + X T r + e (11.37) 

where the various terms on the RHS of (11.37) refer to the overall mean, replicate 
effects, block-within-replicate effects, treatment effects, and error, respectively. 

We recall that the blocks within each replicate are determined by the system of 
confounding for that replicate and that the basic pattern of replicates is determined 
by the various types of systems of confounding employed. We shall therefore 
divide the (p n — l)/(p — 1) interactions E a into three mutually exclusive sets 
based on the amount of confounding: 


£1 = 

: \E ak , k = 

1 ,2,.. 

.,ni J: completely confounded 

£2 = 

■- \E y ^, l = 

1,2,.. 

.,J ： partially confounded 

£3 = 

: \E^ m , m - 

= 1,2,. 

..,^3 J: not confounded 


and E y ^ e £2 is confounded in c(y^) replicates and not confounded in u(y^)= 
s — c(y^) replicates of the basic pattern = 1,2,..., n^). Of course, 

打3 = (P n _ l)/(p - 1). 

Just as the interactions in £1 and £2 determine the composition of the blocks 
within replicates, the interaction components , E < ^_ 1 are the 



ANALYSIS OF p n FACTORIAL EXPERIMENTS 


413 


parametric functions of the treatment effects r(x), which are of main interest 
from the analysis point of view. To estimate these quantities, we shall utilize the 
fact that the design under consideration is resolved in that each treatment combi¬ 
nation occurs once in each replicate (we call a design resolved if it is resolvable 
and actually arranged in replicates). Hence, if we arrange the observations y in 
(11.37) according to replicates and if in each replicate we arrange the observa¬ 
tions in the same order according to the p n treatment combinations, we can write 
the design model matrix in (11.37), that is, the matrix 

X = (0,X p ,Xp*, X T ) 


as 


3 3 
0 0 


心*(1) 


X 


P ⑵ 




f(r) 


x c . 




I 

I 

I 


(11.38) 


where each matrix in (11.38) has p n rows and represents the X^* matrix 

for the jth replicate (j = 1, 2,..., r = sq). We now utilize this form of (11.38) 
and the structure of the to proceed with the analysis. 


11.10.1 Intrablock Analysis 

Consider the RNE (1.7) for an incomplete block design 

(rl -- NN f )r = T~- NB (11.39) 

\ k / k 

where N = X r T X^. In our case, because of the form of (11.38), we obtain 


and hence (11.39) becomes 



x， ^Dy = T -\ NB 


(11.40) 
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Let us consider now the right-hand side of (11.40). If we write y f as 
(3W … ，少 k))，where represents the vector of observations in repli¬ 


cate j, then 


B = 


A * ⑴少⑴ 
^*(2)^(2) 


X ^*(r)y(r) 


and hence 

r 

nb = J2 x ^u) x ^u)y(j) 

.7=1 


Furthermore, because of the arrangement of the observations in the same order 
in each replicate, we have 

r 

T = J2yu) 

7=1 

We then write (11.40) as 

Ef 7 -! X^ U) X^ U) )r = E (，4 X ^ j) 〜)))〜） (1L41) 

7=1 V 7 7=1 V 7 

with k = p l . h follows from (11.41) that, in order to analyze the complete data 
set, we can analyze the data in each replicate separately, for example, obtain 
estimates of the main effects and interaction components, and then average the 
information over all contributing replicates. 

Let us then consider the RNE for the jth replicate: 

(/ — iz ro) ^ 0 ,)r=(/-i X ⑽ y (j) (11-42) 

We shall write the equation for treatment combination x as 

市)-去 E 入 = y 0) ⑺ -I ^j(x,x*) y(j)(x*) (11.43) 
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where r(x) and y(j)(x) represent the effect and response of treatment combina¬ 
tion x, respectively, and 

1 if x and x* are in the 
same block in replicate j 

0 otherwise 


Xj(x,x^)= 


Suppose we want to estimate E 言 ， where [see (11.31)] 

^ jc*e5(^ ： ;a,5) xe"X 


(11.44) 


with S(x;a, 8) representing the set of all treatment combinations x satisfying 
the equation a r x = 8 and X being the set of all treatment combinations. We 
distinguish between two cases: 

(i) Suppose E a is not confounded in replicate j, that is, E 01 e £2 or £ 3 . 
Equations (11.44) and (11.42) then suggest that we consider 


,_ 1 — , _ 1 

yu)( x ) - p ⑺ (f) 


where 


or 


E 






r(x*) 




y(j)(x*) (11.45) 


Recall now that each block contains p l treatment combinations, and since 
E a is not confounded in replicate 7 , the p n ~ l treatment combinations satis¬ 
fying a r x = 8 are equally distributed over the p n ~ l blocks. It follows then that 
Y^ x 入 y(x, x*) = p £_1 for each x* e X. Hence, using k = p l , (11.45) becomes 

4 XI 六 ？） = yu)( x ^ ~ (11.46) 

X P X* X P X* 
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Comparing (11.46) to (11.44) yields 

p “笱 u 、= 一)—去 ( H -47) 

尤 P X* 

or the best linear unbiased estimate (BLUE) of from the yth replicate is 

玲 w = 〜 （ n - 48 ) 

’ x 

with y ⑺ being the mean of all the observations in replicate j. 

(ii) Suppose E 01 is confounded in replicate y, that is, E a e 8, \ or £2 - We con¬ 
sider again (11.45). Since E 01 is confounded, the p n ~ l treatment combinations 
satisfying a'x = 8, that is, the treatment combinations in S(x; a, 8), are dis¬ 
tributed over of the p n ~ l blocks each containing of the x*G S(x m , a, 8). 

Hence 

w f p e for x* e S(x;a,S) 

Md*) = 

I [0 otherwise 

It then follows that both sides of (11.45) are identically zero, and hence repli¬ 
cate j does not contribute to the estimation of E 言 ， that is, does not contribute to 
information about E 01 . 

The derivation above holds, of course, for any interaction E a (a e A) and any 
replicate j (j = 1,2,..., r). Specifically for E ak g £1 it follows then that there 
is no intrablock information. For E yi g £2 we obtain information from qu(yi) 
replicates. Let 

I I if E y ^ is not confounded in replicate j 
0 otherwise 

We then obtain from (11.48) 

r r 

p n ~ l ^0) = I] r](y e ,j) [ y(j)(x*)- P n ~ l y U) 

)=1 7=1 _x*eS(x-y e ,8) 


or, if we let 
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then 

砍 = E W〆)- 广 ％) ( 11 獨 

)=1 |_x*e5(A：;y^,5) — 

Finally, information for e S 3 is obtained from all replicates, so that 

E E ^o)(^)-y (11.50) 

)=1 x*eS(x;8 m ,8) 

where y is the overall mean. 

11.10.2 Disconnected Resolved Incomplete Block Designs 

The result just derived is a special case of a more general result applying to dis¬ 
connected resolved incomplete block designs, which we shall give now. Suppose 
we have r replicates with each treatment combination occurring exactly once in 
each replicate. According to (11.38), we write the model for the 7 th replicate as 

y(j) ^ ^ + X-\- It (11.51) 

[the symbol =, used in 1.4.2 as a shorthand for “approximately described by,” is 
used here to denote the expected value, £*(•), of the expression on the left-hand 
side]. Consider now a linear function of the treatment effects, with §’3 = 0. 
Such a function may or may not be estimable, that is, it may or may not be 
confounded with blocks. We state the following definition. 

Definition 11.2 A linear function with §’3 = 0 is completely confounded 
with blocks in the jth replicate of a resolved incomplete block design if 
is a linear function of block totals. □ 

To obtain a characterization of such confounded functions, we multiply (11.51) 
on both sides by 夸 This gives 

^ 0 -) = r^*0) (11.52) 

the left-hand side of which must equal Y ] r for some rj. This implies that 

^ (11.53) 

On the other hand, if (11.53) holds, it follows from (11.52) that 

r]'X p * {j) y U) = V diag{^ ; i,^ 2 , •••, k jb }fi* +^t 
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since 

X 爲 * 、 j) = diag{^ ji , kj2, • •. ， kjb} 

where kji is the size of the zth block in the jth replicate (j = 1, 2,..., r; i = 
1, 2, ...，/?), or with equal block size k, X= kl. Thus we have the 
following theorem. 

Theorem 11.9 A linear function with §’3 = 0 is completely confounded 
with blocks of the jth replicate of a resolved incomplete block design if and only 
if § = for some rj. 

Using this theorem, we can now derive from basic principles of linear model 
theory that, if a linear function is completely confounded in c of the r 
replicates of a design, is estimated in the usual way from the r — c replicates 
in which it is not confounded, thus corroborating our earlier results concerning the 
estimation of E^'s. Suppose then that is completely confounded in replicates 
1, 2, ...， c; that is, we have 

? = = 尤俨 ⑵ "2 = • •. = X^*( c )Tfj c (11.54) 

and let 

r 

与 ’ t = Du 、 

7=1 

Assuming a GMLM (the justification for this is given in 1.6.3.5), we have 

var a ， j a j a e 

7=1 

Hence, to find the BLUE for §’t ，we minimize Y^j=\ a j a j subject to the unbi- 


asedness conditions 





r 

a j = ^ 

7=1 


(11.55) 

and 




^ a i : 

= 0 for j = 1,2,.. 

., c. 

(11.56) 


The condition (11.56) derives from the unbiasedness conditions a f jX 泛 = ♦ 
together with (11.54) which implies 


爸 aj = Vj X(j、cij = Q for y = 1 ， 2， ... ， c 
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The Lagrangian to be minimized is then 

^ = - a’ - 与 + Ed h57 ) 

7=1 \7=1 / 7=1 

where v and 入 j (j. = 1, 2,… ， c) are Lagrangian multipliers. The derivatives of 
(11.57) yield the equations 

aj — v -\- = (f> 0 = 1,2,..., c) (11.58) 

aj — v =0 (j = c + 1, 2,..., r) (11.59) 

r 

J2 a J = ^ 

7=1 

^aj = 0 (7 = 1,2,..., c) 

From (11.58) we obtain 

+ 入 〆 § = (j = 1, 2,..., c) 

which, using (11.56)，reduces to 

入 jXv (y = l,2,...,c) (11.60) 

implying 入 i = = • — = X c = X, say. Now it follows from (11.55), (11.56), 

and (11.59) that 


r 

.Y^ aj =(r — c)^v 

y=c+i 


Hence, from (11.60) and (11.61) 


and 



It follows then from (11.58), (11.59), (11.62), and (11.63) that 


(11.61) 


(11.62) 

(11.63) 


a j = ♦ for y = 1, 2,..., c 
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and 




a j = 

1 

= - 与 for j = c + 1, r 

r — c 

Hence 

fr: 

& c+1 r 〜） 


r — c 


that is, the estimate of linear function that is completely confounded in 
c replicates is the average of the same linear function of the corresponding 
observations from the replicates in which is not confounded. If in particular 
c = r, then is, of course, not estimable. 


11.10.3 Analysis of Variance Tables 

We now return to the analysis of p n factorial experiments in terms of main effects 
and interactions. Since we have used the reparameterization (11.33)，a complete 
set of estimable functions of the r(x) is given by 

{#， 吋'…， 々 2 : 的 e 

and 

( J7^ m 77f 
^0 ， ^ 1 ， • . . ， l 卩 一 2. l ^ I 

accounting for (ji 2 + n^,){p — 1) d.f. among treatments. Hence we have the T|B- 
ANOVA as given in Table 11.1a. It follows from the derivation of E yi and E Sm , 
for example, (11.49) and (11.50), and the general principle (see Section 1.3.7) 

SS(X r \3,X^) = r / Q 


that 


/ 7-1 


SS(E^)= qu( yt ) P n S 


8=0 


(11.64) 


and 


P~ l 

SS (£ 5m ) = rp n ~ l \Es m 
8=0 


(11.65) 


It is easy to verify that 


p-i 


E[SS (f)] = (p- l)cr e 2 + qu( yi )p n - 1 Y, [EVX 
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Table 11.1 T|B-ANOYA for p n Factorial Experiment 


Source 

d.f. 

SS 


a. Basic Partitioning 


x P \o 

sq — \ 

Usual 

1 0 , x p 

sq{p n ~^ — 1) 

Usual 

X Z \3,X P ,X^ 

(«2 + n 3 )(p - 1) 

j2^(E ye )+J2 SS(ESm ' ) 

l m 

Difference 

I\3,X p ,X^,X r 

VR 

Total 

sqp n — 1 

Usual 


b. Partitioning of Block Sum of Squares 

Xp* 1 0 , x p 

sq(p n ~^ — 1) 

Usual 

{E ak } 

n\{p - 1) 

J2 ss ( Eak 、 

b 


ni{p - 1) 

K 

X)SS(£h) c 

Remainder 

Difference (=v) 

i 

Difference 


and 

尸一 1 2 

E SS (E s …) ^ (p - l)a e 2 + rp n - 1 E l' 

5=0 

Since all the comparisons belonging to E 0 ^ e Ei and those comparisons 
belonging to E yi e £2 from replicates in which E y ^ is confounded do not con¬ 
tribute to SS(X r | 0, X p , X^*), they must contribute to J, X p ). This 

then leads to the partitioning of 3, X p ) as given in Table 11.1b, where 

SS(E ak ) c is obtained from all replicates in the usual way [comparable to (11.65)] 
and SS(E yi ) c is obtained from all replicates in which E yi is confounded 
[comparable to (11.64) with substituted for The components 

(E^ k ) c , (E^ k ) c for E ak G £ 1 , E y ^ e £2 and 8 = 0, l,..., p — 2 form a complete 
set of linearly independent functions involving both block effects and treatment 
effects, accounting for +" 2 )(P — 1) d.f. of SS(Z^*| 3, X p ) and constitut¬ 
ing the treatment component of this sum of squares. The remaining comparisons 
belonging to SS(X 0 *| 3, X p ) are then pure block comparisons accounting for 

v = sq(p n 一之 — 1 ) — (ni + ri 2 )(p — 1 ) d.f. ( 11 . 66 ) 


11.11 INTERBLOCK ANALYSIS 

We have shown in the previous section that intrablock information from sev¬ 
eral replicates can be combined additively to obtain the BLUE for an estimable 
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function. We now turn our attention to interblock information for interaction 
components, which means we consider the situation where )8* is a vector of i.i.d. 
random variables with E(P^) = 0 and E(m = Ia^. In particular, we shall 
be concerned about interblock estimates of E ak e £i and E Yi g £2 from those 
replicates in which is confounded. First we shall establish, however, that 
similar to the case of intrablock information the estimates from several replicates 
can be combined additively to yield BLUEs, a result that does not hold generally. 


11.11.1 Combining Interblock Information 

We consider the model equation (11.51) for the observations in the jth replicate, 
assuming without loss of generality 11 = pj = 0, so that we have 

yu) = /t + + e u) (ii.67) 

We can rewrite (11.67) as 

y (j) = It + f (j) (11.68) 

with E(f^) = 0 and 

E (/(J)/(；)) = IU e + X P*U) X 'fi*(j) a l 

= (l + p*X^ U) X^ U) )^ 

=Vj a] (11.69) 

where p* = o • 彦 /o^ and Vj = I ^*()) (y = 1, 2, ..., r). The model 

equation for the combined data from r replicates is thus 

叫 n r /(i)_ 

J(2) 1 f( 2 ) 

= T + (11.70) 

3^)」L’」 

with variance - covariance matrix 


= di ag {V l ,V 2 ,...,V r }cy^ 


( 11 - 71 ) 
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The Aitken equation (see Section 1.4.16.2) then is 


(J/... J 0diag{V7 1 ， Vf 1 ， ... ， V;} . r 


(i r [../)diag{FT 1 ， q 1 ,... ， V7 1 } 


E^ 1 


(11.72) 


and hence 




If is an estimable function, then the BLUE is given by 


= r E E ^o-) 


(11.73) 


Suppose now we consider the estimation of §’*r separately for each replicate. 
It follows from (11.67) that 


so that the BLUE of from replicate j, () say, is 


’ T ) ,• = 6 ⑺ 


(11.74) 


var(^r) =^Vj ^o] 


(11.75) 
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Combining the estimates (11.74) with weights equal to the inverse of the variance 

(11.75)，we obtain 


EUd/^Vj^^r). 


(11.76) 


We know that (11.73) 匕 BLUE for One can see easily that if Vj = V for all 

j = 1, 2,..., r, then of (11.73) equals of (11.76), but in general these 
estimators will be different. We now give a condition under which (11.73) and 

(11.76) are identical, a condition we shall show to be met for the purpose of 
estimating interaction components. 

Theorem 11.10 In a resolved design the interblock information for an 
estimable function obtained from each replicate can be combined additively 
as in (11.76) to yield the BLUE if, for each j, 

Vj § = (11.77) 

that is, if § is an eigenvector of each variance - covariance matrix Vj (j = 
1 ,2,, r), with = 1. 


Proof (i) The estimator in (11.76) is, of course, unbiased. 

(ii) We know that (11.76) is BLUE. Therefore, for (11.76) to be BLUE, we must 
have, using (11.73), 


r 





This must hold for all j()). Hence from (11.78) we must have 

屯 v 1 ) 


q/rw)r 


Now, if (11.77) holds, it follows that 


and hence 


0 + 




与 


(11.78) 


(11.79) 


(11.80) 
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and 



(11.81) 


Using (11.77) ， (11.80)，and (11.81)，on both sides of (11.79) we obtain 



E,dM) 


which completes the proof. 


□ 


11.11.2 Estimating Confounded Interactions 

We now use Theorem 11.10 to obtain interblock information for E ak e £i and 
E yi e £ 2 . Consider an g £1 or £2 and suppose that E a is confounded in 
replicate j. We then consider = E 言 for some 8 e GF(p). Obviously, we 
have §’3 = 0, and it follows from Theorem 11.9 that 


? = (11.82) 

Using (11.82) in (11.69), we obtain 

V j ^ = (I + p*X^ j) X^ U) )^ 

=^ + P* x P*U)) x 'fi*(i)) x P*U) r l 
= 名 + P*kX 浐 (J、rf 

= (l+p*k)^ (11.83) 

since X’ 浐① X = kl ， where k is the block size. Hence the condition of 
Theorem 11.10 is satisfied, which implies that E^ k can be estimated by combining 
additively the estimates from each replicate. Denoting the overall estimate by 
E ^ k , 8 e G¥(p), we then have, using (11.83) with k = p l , 

曹 ( 笱 m= ^r ( 〜 2 + p l °i) ⑴. 84 ) 

Similarly, we obtain E\ l for E yi g £ 2 , 5 g GF(p) from all replicates in which 
E yi is confounded. Hence 
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The and so obtained are, of course, used to yield 

SS(E a ” c = qsp n _ l [印 ] 2 

5=0 

and 

SS (£^) c = qc(Y e )p n ~ l J2 [⑦] 2 
8=0 

of Table 11.1b. Under the assumptions made earlier in this section it follows that 

p—i 

E[SS(E a ^) c ] = (p- l)(a e 2 + p l crj) + qsp n ~ l [ E ?f 

5=0 

and 

p—i 

E[SS(E^) C ] = (p- 1 )( 〜 2 + p l oj) + qc{y t )p n - 1 ^ [E y s e f 

8=0 

Since, in Table 11.1b, 

£ [MS(Remainder)] = + p l cr^ (11.86) 

we have a test for 

H 0 : E: k = E? k = … =E u p k _ x = 0 

with E ak e £i, which was not available before from the intrablock analysis. 

11.12 COMBINED INTRA- AND INTERBLOCK INFORMATION 

To summarize the results so far, we have the following information available for 
E ak e £i ： interblock information only 

E Yi g £ 2 ： intrablock information from qc^y^) replicates and interblock 
information from qc(y^) replicates 
E Sm e £ 3 ： intrablock information only 


We have discussed how to get these types of information in Sections 11.10.1 and 
11.11.2. It remains then to combine the two types of information for g £ 2 . 
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11.12.1 Combined Estimators 

We have seen that for = E y ^ the condition of Theorem 11.10 is satisfied for 
those replicates in which E y ^ is confounded. For those replicates in which E y ^ 
is not confounded, we have = 0 because of unbiasedness, and hence 

so that the condition of Theorem 11.10 is trivially satisfied. It follows then that we 
can combine the estimates from all replicates additively by weighting inversely 
with the respective variances. Thus, letting w = l/o^ 2 and w f = l/(o^ 2 + p l cr^), 

the combined estimator for — E y ^ 1 , say E y ^ — is given by 

p e _ p t 二 u(y e )w (E v s l - El 1 ) + cjy^w' (E y s e - E r s ：) 

8 8, u{y^)w + c{y^w f 

and 

var ( 卽 — E y A = -—— r 

V s / M(y^) + c(y £ )p - 1 ^"- 1 

where p = w/w f . To estimate w and w r (or p), we use the B|T-ANOVA of 
Table 11.2. This yields 

a^^=MS(I\0,X p ,X f) *,X r ) (11.89) 

and one obvious estimator for = 1/w/ is 

(<7》+ 〆#) = = MS (Remainder) (11.90) 

Another estimator for + can be obtained by evaluating 五 [MS(X i g*|3, 

X p , Xr)] in Table 11.2. In order to do that, we first obtain SS vs. E ye ) in 
Table 11.2b. 

Let 


(11.87) 


( 11 . 88 ) 




for E yt e £2 and 5 = 0, 1 ， .•• ， /?—1. Any X y ^ is free of treatment effects but 
is a function of block effects and replicate effects. Hence comparisons among 
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Table 11.2 B|T-ANOVA for p n Factorial Experiment 


Source 

d.f. 

SS 


a. Basic Partitioning 



sq — l 

Usual 

x T \0,x p 

p n - 1 

Usual 

X^\3,X P ,X Z 

sq(p n-t - l)-ni(p - 1) 

Difference 

I\D,X p ,X r ,X^ 

VR 

From Table 11.1 

Total 

sqp n — 1 

Usual 

b. Partitioning of Block Sum of Squares 


x^\o,x p ,x z 

sq(p n ~ l (p - 1) 


vs. 

ni(p - 1) 

See (11.93) 

Remainder 

V 

From Table 11.1 


the X^ £ for fixed y t and 8 = 0, l,..., p — 1 contribute to SS(X^*\0, X p , X r ). 
More specifically we have 


SS (E^t vs. 


- 

-K-l' 

/ 

- 

Y yi n 
_ x p-i 

x^- 


「〜 r 

L O'e J 


- 

.K-2 - 



Y yi 

L X p-2 ' 

-K-!. 


(11-91) 


rYt 


where V yi is the variance - covariance matrix of the vector of comparisons 

/ = 0, 1,— 2. From the form of these comparisons it follows that 


V 


ye 


p' 




and hence 


V 


Ye 


qu(Yt) ciy,,) p 


n-2 


(pi - oo f ) 


We then obtain, from (11.91 )， 


SS(^ vs. ^)= - ⑹， P n —[ g [利 


(11.92) 



COMBINED INTRA- AND INTERBLOCK INFORMATION 


429 


and summing (11.92) over all E y£ e £2 


SS(E^ vs. C ^— 乙咖） c ⑹ 


(11.93) 


Using the definition of X^ 1 and 

E [X v s ll [ = var (Z^) = var (E^ 1 ) + var (£^) 

= P ~ l P ~ l fe + P 1 ^) 

qu(y t ) p n qc(y e ) p n V ” 

it follows that 


E [SS vs. E^)] = (^ - 1) (a e 2 + l ^-p t - ( rj 


Hence 


E[SS vs. £〜)] = (p-l) 


n 2 o^ + !—Y^ u( y t> a j 


From (11.86) and (11.94) it follows then that 

£ [MS (Z^| 3, Z T )] = a ； + Ap^j 


where 


A 


[(p - ^)/s]^ u (yi)^~ v 

sq(p n - 見 —1) — n\(p — 1) 


(11.94) 


(11.95) 


and v as given in (11.66). A second estimator for w/ is then obtained from (11.89) 
and (11.95) as 




2 w f 


K: 




+ -MS(^|3,X p ,Z t ) 


(11.96) 


Using (11.89) and (11.96) in (11.87) yields the combined estimator for E^ 1 — 
El 1 , y = 0, 1 ， 1， (5 # 5'. 
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11.12.2 Variance of Treatment Comparisons 

Recalling the parameterization of the true response of a treatment combination 
as given in (11.33)，we can estimate the difference of the effect of two treatment 
combinations x and z, say, as 


with 


( 11 - 97 ) 

aeA 


var 




h z ) 


2(7： 


u(cc) + c(a)p~ l qp n 


(11.98) 


using the notation of (11.88). In (11.98) u(a), c(a) = 0, l,..., s with u(a) + 
c(a) = s for each a e A, with ^ being the totality of partitions for the p n factorial. 
Since u f x = a!z for (p n - 1 — l)/(p — 1) different a e A, the right-hand side 
of (11.97) contains p n ~ x terms corresponding to those a e A*, say, for which 
a f x ^ a!z. It follows then from (11.97) that 

var(?(x) -f(z)) = J2 ^(K' x -h z ) (11.99) 

aeA* 


with the appropriate expressions from (11.98) substituted in (11.99). Obviously 
A* and hence (11.99) depend on x and z and the particular system of confounding 
that has been used. 

Rather than compute the variance for each treatment comparison separately, it 
may in some situations be appropriate to consider an average variance. To obtain 
the average variance, we consider, for a fixed treatment combination x, T(x) — 
t~{z) for all z ^ x. For a fixed aeA there exist p n ~ l treatment combinations z 
that satisfy the equation 


ai(x\ - z\) + a 2 (X 2 - Z2) H - Voi n (x n - z n ) = ^ 


for 8 e G¥(p). Since there are p — 1 nonzero 8 e GF(p)，it follows that for each 
oc e A there are (p — l)p n ~ l different z’s with a f x ^ cl' z. Hence, using (11.98 )， 


1 

av. var = - 

p n - 1 


^2 var(?(jr) 


= 2 


P~ 1 
p n — l 


aeA 


1 

0(a) + c(a)p~ l )q 


( 11 . 100 ) 
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11.13 THE s n FACTORIAL 

To complete the discussion of the general symmetrical factorial experiment, we 
shall now indicate how the methodology developed for the p n factorial can be 
extended to the s n factorial, where s = p m and p is a prime. One way in which 
this is achieved is by replacing arithmetic in GF(p) for the p n by arithmetic in 
GF(p m ) for the s n factorial. Another way is to treat the (p m ) n factorial as a p mn 
factorial, that is, mn factors with p levels each, these factors being referred to 
as pseudofactors. 


11.13.1 Method of Galois Field Theory 

Suppose we have factors A\, A 2 ,..., A n . We denote a treatment combination 
as usual by x' = (x\, X 2 , ..., x n ) where Xi(i = 1 , 2, ..., n) takes on the values 
0, l,..., s — l. Also, we denote the elements of GF(p m ) by wo, wi, ... ， u s -\ (see 
Appendix A). The partitioning of the s n treatment combinations into s mutually 
exclusive sets of s n ~ l treatment combinations each associated with the interaction 
E 01 = A °^ 2 ... A°^ n is obtained by considering the equations 

^x\ ^o；2 ^X2 + . . . + ^a n ^x n = ^8 (11.101) 


where e G¥{p m ). Each of the s equations (11.101) is satisfied by 一 1 treat¬ 
ment combinations as at least one u ai # wo = 0. Altogether there are (s n — 
1)/(^ — 1) such partitions and hence (s n — 1)/(5- — 1) effects and interactions, 
each accounting for s — 1 d.f. 

We illustrate this procedure in the following example. 


Example 11.5 Take p = 2 , m = 2 , n = 2 , that is, the 4 2 factorial. The ele¬ 
ments of GF(2 2 ) are wo = 0, wi = l, U 2 = x, u^, = x l with P(x) = x 2 x -\- 
1 (see Example A3 in Appendix A). The addition and multiplication tables for 
elements in GF(2 2 ) are 


Addition Multiplication 



Mo 

U\ 

U2 

M 3 


以 0 

U\ 

U2 

«3 

uo 


U\ 

U2 

M3 

uo 

uo 

uo 

wo 

wo 

U\ 


wo 

M3 

U2 

U\ 


U\ 

U2 

M3 

U2 



Mo 

U\ 

U2 



M 3 

U2 

U3 




uo 

W3 




U2 
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Denoting the factors by A and B, we have the following partitions and associated 
interactions: 


Partition 

Equations 

Interaction 

(1,0) 

Uxi — Us 

A 

(0,1) 

^X2 = 

B 

(1,1) 

^x\ H - ^X2 = 

AB 

(1,2) 

+ U2Ux2 = ^-8 

AB 2 

(1,3) 

+ ^3^X2 = ^<5 

AB 3 


with us e GF(2 2 ). Using the addition and multiplication tables, we can determine 
easily for each treatment combination (x\,X 2 ) which equation it satisfies for a 
particular interaction. The results of these computations are given in Table 11.3. 
It follows then, for example, that AB 2 is defined by comparisons among the four 
sets of treatment combinations: 


wo: 

(0, 0 )， 

(2, 1), 

(3,2), 

(1,3) 


Ml ： 

(1,0), 

(3, 1), 

(2, 2), 

(0, 3) 


U2.. 

(2,0), 

(0, 1), 

(1,2), 

(3,3) 


W3: 

(3,0), 

(1,1), 

(0, 2), 

(2,3) 

□ 


Table 11.3 Calculation of Partitioning for Effects and Interactions for 
4 2 Factorial 


Treatment 

Combination 

Effects 


Interactions 

xi 

X2 

A 

B 

AB 

AB 2 

AB 3 

D 

0 

uo 

uo 

uo 

uo 

uo 

1 

0 

U\ 

UO 

U\ 

U\ 

U\ 

2 

0 

U2 

uo 

U2 

U2 

U2 

3 

0 

U2, 

uo 

W 3 

M 3 

M 3 


1 

uo 

U\ 

U\ 

U2 

M3 

1 

1 

U\ 

U\ 

uo 

M 3 

U2 

2 

1 

U2 

U\ 

M 3 

uo 

U\ 

3 

1 

M3 

U\ 

U2 

u\ 

uo 


2 

uo 

U2 

U2 

M 3 

U\ 

1 

2 

U\ 

U2 

M 3 

U2 

uo 

2 

2 

U2 

U2 

uo 

U\ 

M3 

3 

2 

M 3 

U2 

U\ 

wo 

U2 


3 

uo 

M 3 

W 3 

U\ 

U2 

1 

3 

U\ 

M3 

U2 

UO 

M 3 

2 

3 

U2 

M 3 

U\ 

W 3 

uo 

3 

3 

M 3 

M 3 

UQ 

U2 

U\ 
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11.13.2 Systems of Confounding 

The method of constructing systems of confounding can now also be extended 
quite easily. Suppose we consider a s n factorial in blocks of size s i . We choose 
n — i independent interactions, E ai , E ai ,..., E an ~ e say, to be confounded with 
blocks. The s n ~ l blocks are then obtained from the equations 

^an ^x\ + ^a\ 2 ^X 2 + • • . + ^oi\ n ^x n = 

^X\ H - + . . . + ^Ol 2 n^X n = ^82 

. (11.102) 

u a q i u xi + ^a q 2 u X 2 + • • . + u a qn ^x n = ^ 8 q 

where q = n — l, a; = (of/ \ , 啦， ... ，心 )， G GF(^), i = 1,2,... ,q. For a 
fixed set (m^ 19 . •. ， us ) Eqs. (11.102) are satisfied by treatment combina¬ 

tions, which then form a block. 

Further, any treatment combination that satisfies, say, the equations 

^a ， 2 ^X 2 + • • . + ^a n ^x n = 
and 

+ + . • • + U^ n U Xn = U8 2 

also satisfies the equation 

(^ai + + ( u a 2 + u k u @l) U X 2 "I - 

+ = H - ^k u 82 (11.103) 

for any u\ G GF(/? m ). If us { , us 2 now take on all the values in GF(p m ), then the 
right-hand side of (11.103) takes on all values of GF(p m ) exactly s = p m times 
for fixed u\. Hence (11.103) provides a partition of the s n treatment combinations. 
The interaction associated with it is, by analogy to the p n case, referred to as 
a GI of A°[ l ... A^ n and A^ 1 A ^ 2 ... A^ n , denoted by A ^ 1 A ^ 2 ... A^ n with 
(Ki ， K2,... ， )4) to be determined from 

+ W 入 = Wyj 

Wq ；2 + 爲 2 = ^Y 2 

: (11.104) 

+ UXU^ n = Uy n 
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Since u\ G GF(p m ), u\ ^ uo, it follows that A°^ 1 A ^ 2 ... A^ n and A^ 1 A ^ 2 ... A^ n 
have s — l GIs among them. Extending this argument (see Section 11.4) implies 
then that with E ai , E a2 , .. . ， all their GIs are also confounded with blocks, 
accounting for the s n ~^ degrees of freedom among blocks. 

As in the p n case, the composition of the blocks can be obtained by vary¬ 
ing the right-hand side of (11.102). Alternatively, we can obtain the IBSG 
by considering (11.102) with U8 t = uo(i = 1,2,... ,q) and then generate the 
remaining blocks in the familiar way by adding treatment combinations to those 
of the IBSG using GF(p m ) arithmetic. As an illustration, consider the following 
example. 

Example 11.6 Suppose we have a 4 3 factorial in blocks of size 4, that is, p = 
2, m = 2, n = 3, l = 1. We have 16 blocks, that is, we confound five interactions 
with blocks. Suppose we confound AB and BC 2 , that is, a; = (1, 1 ， 0 )，= 
(0, 1, 2). From (11.104) with u\ = u\, U 2 , it follows, using the addition and 
multiplication tables from Example 11.5, that 


U\ + U\U{) 

=U\ 

U\ + U2UQ 

=U\ 

U\ + W3M0 = 

= U\ 

U 1 + M1M1 

= wo 

U\ + U 2 U\ 

=M3 

U\ + U^U\ - 

=U 2 

Uq U\ U 2 

= U2 

Uq + U 2 U 2 

=W3 

UQ + W3W2 = 

=U\ 


and hence AC 2 , A^ 3 C 3 , and AB 2 C are also confounded with blocks. As a 
result, 9 d.f. belonging to 2-factor interactions and 6 d.f. belonging to 3-factor 
interactions are confounded with blocks. The IBSG is obtained by considering 
the equations 

^x\ H - = wo 

W 叉 2 + ^2^X3 = ^0 

They are satisfied by the treatment combinations 

(wo, wo, wo) 〜 （ 0, 0, 0) 

Oi，l/i, i/3) 〜 （1， 1， 3) 

(w 2 , w 2 , wD 〜 （ 2, 2, 1) 

(W 3 , w 3 , m 2 ) 〜 （ 3, 3, 2) 

where the expressions on the right are the (x\,X 2 , ^ 3 ). Another block can be 
constructed by noting that, for example, the treatment combination (u\, M 2 , M 3 ) 
is not in the IBSG. Hence, adding this treatment combination to each in the 
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IBSG, we obtain the block consisting of 

(Wo + Ml, Mo + U2, Uo + W 3 ) = (wi, U2, W 3 ) 〜 (1 ， 2, 3) 

Ol + Ml, Ml + U2, M3 + M3) = (WO, W3, Wo) 〜 （ 0, 3, 0) 

(U2 + Wi, M2 + U2, U\ + W 3 ) = (M 3 , Uq, U2) 〜 (3, 0, 2) 

(M 3 + Ml, M 3 + U2, U2 + W 3 ) = (M2, U\, W!) 〜 (2, 1, 1) 

This process is then continued in the obvious and familiar way. □ 

An equivalent system of confounding using SAS PROC FACTEX is given in 
Table 11.4. It, too, confounds 9 d.f. belonging to 2-factor interactions (as indi¬ 
cated by the Aliasing Structure) and 6 d.f. belonging to the 3-factor interaction. 
We repeat that SAS uses a different parameterization than the one we have used. 

Finally, we mention that Theorem 11.7 holds for the s n factorial also. All we 
need to do is replace p by s = p m and generate the IBSG by using GF(p m ) 
arithmetic. For example, it is possible to accommodate n = 5 factors with four 
levels each in blocks of size 16 without confounding main effects and 2-factor 
interactions. 


11.13.3 Method of Pseudofactors 

Let A\, A 2 ,..., A n again denote the factors in the (p m ) n = s n system. We then 
associate with factor A/ the m pseudofactors Xn, X 。， …， X/ m (i = 1,2,...，"） 
each at p levels. This association implies the following: (i) If at is the level 
of factor Ai in a treatment combination and Xij is the level of factor Xij in a 
treatment combination belonging to the p m system [Xu, X 。， …， X/ m }, we then 
set up the correspondence 


\Cl 1 , CI 2 ? . . . ， a") = (Xi 1 , 又 12， • . . ，， 义 21 ， 义 22， • . • ， *^2m? 


• 5 ^n \ ? ^n2 ， ^nm) 


that is, each level at of the factor A/ corresponds to a level combination 
(Xi ， 々 2, ..., Am) of the pseudofactors Xn, X 。， ... ， Xi m (i = 1,2,, n). (ii) 
The main effects and interactions in the s n system correspond to sets of main 
effects and interactions within and among the n p m systems. More formally 
we express this as follows: Let [E ai , d[ g A } denote the set of main effects 
and interactions associated with the p m system [Xu, X/ 2 , • • •, ^-im) and let 
E ai x E a j ， (a/, oij) g Ai x Aj, represent an interaction among factors in the p m 
systems {Xu, Xn ，..., U and {Xj\, Xj 2 , … ， Xj m } with Ai x Aj denoting 
the totality of the corresponding partitions (a/, ctj). Continuing in this fashion, 
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Table 11.4 4 3 Factorial in Blocks of Size 4 (Confounding 3 d.f. From Each 
2-Factor Interaction and 6 d.f. From the 3-Factor Interaction) 


options nodate pageno=l ； 
proc factex; 
factors A B C/nlev=4 ； 
blocks size=4 ； 
model est=(A B C )； 

examine design confounding aliasing ； 

output out=design blockname=block nvals=(1 23456789 

10 11 12 13 14 15 16); 

titlel 'TABLE 11.4'; 

title2 '4**3 FACTORIAL IN BLOCKS OF SIZE 4'; 

title3 'CONFOUNDING 3 D.F. FROM EACH 2-FACTOR INTERACTION'; 

title4 'AND 6 D.F. FROM THE 3-FACTOR INTERACTION'; 

run ； 

proc print data=design ； 
run ； 


The FACTEX Procedure 
Design Points 

Experiment 

Number ABC Block 


1 

0 

0 

0 

1 

2 

0 

0 

1 

16 

3 

0 

0 

2 

6 

4 

0 

0 

3 

11 

5 

0 

1 

0 

12 

6 

0 

1 

1 

5 

7 

0 

1 

2 

15 

8 

0 

1 

3 

2 

9 

0 

2 

0 

14 

10 

0 

2 

1 

3 

11 

0 

2 

2 

9 

12 

0 

2 

3 

8 

13 

0 

3 

0 

7 

14 

0 

3 

1 

10 

15 

0 

3 

2 

4 

16 

0 

3 

3 

13 

17 

1 

0 

0 

8 

18 

1 

0 

1 

9 

19 

1 

0 

2 

3 

20 

1 

0 

3 

14 

21 

1 

1 

0 

13 

22 

1 

1 

1 

4 

23 

1 

1 

2 

10 
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Table 11.4 (Continued) 


24 

1 

1 

3 

7 

25 

1 

2 

0 

11 

26 

1 

2 

1 

6 

27 

1 

2 

2 

16 

28 

1 

2 

3 

1 

29 

1 

3 

0 

2 

30 

1 

3 

1 

15 

31 

1 

3 

2 

5 

32 

1 

3 

3 

12 

33 

2 

0 

0 

10 

34 

2 

0 

1 

7 

35 

2 

0 

2 

13 

36 

2 

0 

3 

4 

37 

2 

1 

0 

3 

38 

2 

1 

1 

14 

39 

2 

1 

2 

8 

40 

2 

1 

3 

9 

41 

2 

2 

0 

5 

42 

2 

2 

1 

12 

43 

2 

2 

2 

2 

44 

2 

2 

3 

15 

45 

2 

3 

0 

16 

46 

2 

3 

1 

1 

47 

2 

3 

2 

11 

48 

2 

3 

3 

6 

49 

3 

0 

0 

15 

50 

3 

0 

1 

2 

51 

3 

0 

2 

12 

52 

3 

0 

3 

5 

53 

3 

1 

0 

6 

54 

3 

1 

1 

11 

55 

3 

1 

2 

1 

56 

3 

1 

3 

16 

57 

3 

2 

0 

4 

58 

3 

2 

1 

13 

59 

3 

2 

2 

7 

60 

3 

2 

3 

10 

61 

3 

3 

0 

9 

62 

3 

3 

1 

8 

63 

3 

3 

2 

14 

64 

3 

3 

3 

3 


Block Pseudofactor Confounding Rules 

[Bl] = (3*A) + (3*B) + (3*C) 

[B2] = A+(2*B) + (3*C) 
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Table 11.4 (Continued) 


Aliasing Structure 
A 

(2*A) 

(3*A) 

B 

(2*B) 

(3*B) 

C 

(2*C) 

(3*C) 

A + B 

[B] = (2*A) + B 

(3*A) + B 

A + (2*B) 
(2*A) + (2*B) 
[B] = (3*A)+(2*B) 
[B] = A + (3*B) 
(2*A) +(3*B) 
(3*A) + (3*B) 
A + C 

(2*A) + C 

[B] = (3*A) + C 
[B] = A + (2*C) 

(2*A) + (2*C) 
(3*A) + (2*C) 
A + (3*C) 
[B] = (2*A)+(3*C) 
(3*A) + (3*C) 

B + C 
[B] = (2*B) + C 
(3*B) + C 
B + (2*C) 
(2*B) + (2*C) 
[B] = (3*B) +(2*C) 
[B] = B + (3*C) 

(2*B) + (3*C) 
(3*B) + (3*C) 


Obs 


block 


A 


B 


C 
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Table 11.4 (Continued) 


56 

14 

3 

3 

2 

57 

15 

0 

1 

2 

58 

15 

1 

3 

1 

59 

15 

2 

2 

3 

60 

15 

3 

0 

0 

61 

16 

0 

0 

1 

62 

16 

1 

2 

2 

63 

16 

2 

3 

0 

64 

16 

3 

1 

3 


we then have the correspondence 

A, = eAi} 

Ai x Aj = {E ai x E a j ： (oti, ccj) e A[ x Aj] 

At x Aj x Ak = [E ai x E 0Lj x E ak .. au) G A[ x Aj x Ak] 

and so on, up to 

A\ x A 2 x - - - x A n = [Tl x E ai .. (a! ， ci 2 , ..., oi n ) e Yl x Ai] 

This implies, of course, that the s n system can be handled entirely by GF(p) 
arithmetic rather than by GF(p m ) arithmetic. As an illustration, consider the 
following example. 

Example 11.7 Suppose we have a (2 2 ) 3 factorial. For ease of notation we 
use the following correspondence for the factors (A /； Xn, Xi 2 )\ 

4 3 System 2 6 System 

A Xi,X 2 

B Y U Y 2 

C Z 2 

and for the levels (a /； 

4 3 System 2 6 System 

0 0 , 0 

1 1 , 0 

2 0 , 1 
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Table 11.5 Correspondence of Main Effects and 
Interactions for 4 3 and 2 6 System 


4 3 System 

2 6 System 

A 


Xi,Z 2 ,XiZ 2 

B 


Yi,Y 2 ,YiY 2 

C 


Zi, Z 2 , Z 1 Z 2 

A x 

B 

XxYuX^XxY^ 

X2Yi,X2Y2,X 2 Y l Y2 

X l X 2 Y l ,X l X2Y2,X l X 2 Y l Y2 

A x 

C 

ZiZ 1 ,XiZ 2 ,XiZiZ 2 

Z 2 Z 1 ,Z2Z2,X 2 Z 1 Z2 

ZiZ 2 Zi,ZiZ2Z2,XiX 2 ZiZ2 

B x 

C 

FiZi,FiZ 2 , riZ!Z 2 

F2Zi,F2Z2,r 2 Z 1 Z2 

Y l Y 2 Z 1 ,Y l Y2Z2,Y l Y 2 Z l Z2 

Ax B 

x C 

x 1 r 1 z 1 ,x l r 1 Z 2 ,x 1 r 1 z 1 z 2 
X l Y 2 ZuX l Y2Z2,X l Y 2 ZiZ2 
x l Yir 2 z l , x l r l r 2 Z 2 , x 1 r 1 r 2 z l Z 2 



x 1 x 2 rir 2 z 1 , x 1 x 2 rir 2 Z 2 , x 1 x 2 y 1 y 2 z 1 z 2 


The correspondence of effects and interactions in both system is then as given 
in Table 11.5. This representation implies that all main effects and interactions 
in the 4 3 system are expressed as sets of single degrees of freedom contrasts; for 
example, the 3 d.f. for A are represented by the three orthogonal contrasts 


X\ = 

= 0 

vs. 

X\ = l 


X2 = 

= 0 

vs. 

X2 = l 


X\ X2 = 0 

vs. 

Xi X2 = 1 

□ 


The value of the method of pseudofactors just described apart from the reduc¬ 
tion to GF(p) arithmetic lies in its flexibility of constructing systems of con¬ 
founding in blocks of size rather than (p m Y, although it requires a certain 
amount of care in order not to confound contrasts belonging to main effects or 
low-order interactions if that can be avoided. 

Example 11.8 For the 4 3 factorial we may want to consider blocks of size 
8. According to Theorem 11.7，we can find a system of confounding for the 
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2 6 factorial in blocks of size 2 3 without confounding main effects and 2-factor 
interactions. This assures us that no contrasts belonging to A, B, or C in the 
4 3 system have to be confounded. A suitable system of confounding would be 
to confound XiX 2 Y 2 , X x Y x Z u X 2 YiZ 2 and their GIs X 2 YiY 2 Z u XiYiY 2 Z 29 
X 1 X 2 Z 1 Z 2 , Y 2 Z 1 Z 2 . Referring to Table 11.5, this means that we confound 
four contrasts belonging to A x B x C, namely (X\Y\Z\, Z 2 F 1 Z 2 , ^ 2^1 
X 1 Y 1 Y 2 Z 2 ), and one contrast each belonging to A x B, namely (X 1 X 2 Y 2 ), 
A x C, namely (X 1 X 2 Z 1 Z 2 ), and B x C, namely (Y 2 Z 1 Z 2 ). We mention that 
confounding Y 2 Z 1 Z 2 is not necessarily more desirable than confounding, for 
example, Y 2 Z 1 since both belong to the B x C interaction. However, confound¬ 
ing X 1 X 2 , Y 1 Y 2 , Z 1 Z 2 is undesirable. 

It should be obvious how this method of pseudofactors is then used to analyze 
data from such a design. For Example 11.8, with q replicates, we have the T|B- 
ANOVA as given in Table 11.6, where 

_ r 

SS(^^W) = SS(XiFi) + SS(XiF 2 ) + SS(XiyiF 2 ) 


+ SS(X 2 Fi) + SS(Z 2 F 2 ) + SS(X 2 Fiy 2 ) 


+ SS(XiX 2 Fi) + SS(XiX 2 YiY 2 ) 


as follows from Table 11.5 and the fact that X 1 X 2 Y 2 is confounded with blocks. 
The other sums of squares in Table 11.6 are obtained similarly. □ 


Table 11.6 T|B-ANOYA for 4 3 Factorial in Blocks of Size 8 


Source 

d.f. 

SS 

x P \o 

a - 1 

Usual 

X^*\0,x p 

lq 

Usual 

X Z \D,X P ,X P * 

56 


A 

3 

SS(X 1 )-hSS(X 2 )-hSS(X 1 X 2 ) 

B 

3 

ss(yi) + ss(y 2 ) + ss(yiy 2 ) 

Ax B 

8 

SS(X a 1 1 X^ 1 Y{ l Y^) 

C 

3 

SS(Z 1 ) + SS(Z 2 ) + SS(ZiZ 2 ) 

AxC 

8 


B x C 

8 

SS^'Y^zl'Z^ 2 ) 

A x B x C 

23 

^ SS (X^ 1 X2 2 F 2 72 zf 1 z s 2 2 ) 

I\0,X p ,X^*, X T 

56(^ - 1) 

Difference 

Total 

64q — 1 

Usual 
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11.13.4 The (pi x p 2 x ••• x p m ) n Factorial 

The method of pseudofactors can also be used when the number of levels, 
s, is a product of prime numbers, say s = p\p2.. • p m ^ even more gen¬ 
erally, a product of prime powers, say s = p; 1 p& 2 … p^ m • Using pseudofac¬ 
tors then means that the (p^p^ 2 - - - Pm m ) n factorial can be treated by combin¬ 
ing (pi 1 ) , (p^ 2 ) ， … ， (Pm m ) factorials. Rather than discuss the general case, 
we shall describe briefly how to use this method for the 6 〃 factorial, that is, 
s = 6, pi = 2, p2 = 3. 

Denote the factors for the 6 〃 factorial by A\, A 2 ,..., A n . For At introduce 
the pseudofactors Xu, X。, where Xn has p\ =2 levels and X /2 has p 2 = 3 
levels (/ = 1, 2, … ， n). From our discussion in Section 11.13.3 it is clear then 
how we establish a correspondence between the levels of the six-level factor and 
those of the pair of two- and three-level factors: 


6 n System 2 n • 3 n System 

0 0 , 0 

1 1 , 0 

2 0 , 1 

3 1, 1 

4 0, 2 

5 1, 2 

Similarly, the correspondence between main effects and interactions in the two 
systems is exemplified in Table 11.7. 

This correspondence also suggests various possibilities for constructing sys¬ 
tems of confounding. For example, for the 6 2 factorial we may consider blocks of 
size 18 ， 12, 9, 6. The allocation of the treatment combinations to the blocks can 
be achieved easily by using the methods for constructing systems of confounding 
for the 2 2 and/or 3 2 factorials discussed in Chapters 8 and 10. The basic ideas 
are summarized in Table 11.8. Notice that for blocks of size 18 and 9 we use 

confounding in the 2 2 system only, for blocks of size 12 we use the 3 2 system 

only, but for blocks of size 6 we use both the 2 2 and the 3 2 system. 

The idea is, for example for blocks of size 12, to first construct blocks of 
size 3 in the 3 2 system by confounding X 21 and then “augment” those blocks 
by “adding on” to each treatment combination (in the 3 2 system) in a given 
block all the treatment combinations for the 2 2 system (actually, this method 
can be more formally described as the Kronecker product design of a system 
of confounding for the 3 2 factorial with the full 2 2 factorial). If we denote a 
treatment combination by (xn, X 21 , x\ 2 , X 22 ) where x\\ and X 21 refer to the 2 2 
system and x \2 and X 22 to the 3 2 system, then confounding X 21 leads to the 
three “intermediate” blocks given in Table 11.9a. These are augmented to give 
the blocks in Table 11.9b and, using the correspondence set up for the 6 n and 
2 n 3 n systems, the final blocks in the 6 2 system in Table 11.9c. A closer look at 
Table 11.9c reveals the obvious, namely that not all contrasts defining the main 
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Table 11.7 Correspondence Between Main 
Effects and Interactions in 6 n and 2 n 3 n Systems 


6 n System 

d.f. 

2 n 3 n System 

d.f. 

A/ 

5 

Xn 

1 



X i2 

2 



XnX i2 

2 


25 

XnX jl 

1 



XnX j2 

2 



x i2 x n 

2 



Xi 2 X j2 

2 



Xi2X 2 j2 

2 



XnXjiX i2 

2 



XnXjiXj 2 

2 



X n X i2 Xj2 

2 



x n x i2 x 2 j2 

2 



XjiX i2 Xj 2 

2 



XjlX i2 X 2 j2 

2 



XiiXj l Xi 2 Xj2 

2 



x n x n x i2 x 2 j2 

2 


Table 11.8 Systems of Confounding for 6 2 Factorial 


Block Size 

Confound 


In 2 2 3 2 System 

In 6 2 System^ 

18 

^ 11^21 

AiA 2 (1) 

12 

^12 

山⑵ 

9 

Zii,Z 2 i,XiiZ 2 i 

Ai(1),A 2 (1),AiA 2 (1) 

6 

^11^21,^12^22, ^11^12^21^22 

AiA 2 (5) 


a Number in parentheses indicates the number of degrees of freedom confounded from the indicated 
interaction. 


effect A are estimable because they are confounded with blocks. This is also 
indicated in the third column of Table 11.8 ， where, for example, A\{2) means 
that 2 d.f. for main effect A\ are confounded with blocks. 

It is for this reason that this particular system of confounding by itself may 
not be the most useful. A system of partial confounding with X 22 confounded in 
a second replicate solves this problem to some extent. A better method may be, 
however, to confound 2 d.f. from the A 1 A 2 interaction, for example, to construct 
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Table 11.9 6 2 Factorial in Blocks of 
Size 12 


Block 

1 2 3 


a. Intermediate Blocks in 3 2 System 





b. 

Blocks in the 2 2 3 2 Systei 

m 




0 

0 

0 

0 

0 

0 

1 

0 

0 

0 

2 

0 

1 

0 

0 

0 

1 

0 

1 

0 

1 

0 

2 

0 

0 

1 

0 

0 

0 

1 

1 

0 

0 

1 

2 

0 

1 

1 

0 

0 

1 

1 

1 

0 

1 

1 

2 

0 

0 

0 

0 

1 

0 

0 

1 

1 

0 

0 

2 

1 

1 

0 

0 

1 

1 

0 

1 

1 

1 

0 

2 

1 

0 

1 

0 

1 

0 

1 

1 

1 

0 

1 

2 

1 

1 

1 

0 

1 

1 

1 

1 

1 

1 

1 

2 

1 

0 

0 

0 

2 

0 

0 

1 

2 

0 

0 

2 

2 

1 

0 

0 

2 

1 

0 

1 

2 

1 

0 

2 

2 

0 

1 

0 

2 

0 

1 

1 

2 

0 

1 

2 

2 

1 

1 

0 

2 

1 

1 

1 

2 

1 

1 

2 

2 




c. Blocks in the 6 2 System 






0 

0 



2 

0 



4 

0 



1 

0 



3 

0 



5 

0 



0 

1 



2 

1 



4 

1 



1 

1 



3 

1 



5 

1 



0 

2 



2 

2 



4 

2 



1 

2 



3 

2 



5 

2 



0 

3 



2 

3 



4 

3 



1 

3 



3 

3 



5 

3 



0 

4 



2 

4 



4 

4 



1 

4 



3 

4 



5 

4 



0 

5 



2 

5 



4 

5 



1 

5 



3 

5 



5 

5 



the intermediate blocks by confounding X\\X2i (see Table 11.7). Such a method 
will be discussed in Section 11.14. 

The design displayed in Table 11.9 can also be produced using SAS PROC 
FACTEX as given in Table 11.10 

Finally, a method of combining systems of confounding for different compo¬ 
nent systems, as for the 6 2 factorial in blocks of size 6 (see Table 11.8), will be 
described in Section 12.2. 


\ —/ 

0 12 
( 2 ( 2(2 


s. -/ s. -/ 

0 12 
11 11 11 

r\ /I\ /I\ 


0)D2) 
( 0 ( 0(0 
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Table 11.10 6 2 Factorial in Blocks of Size 12 (Using Pseudofactors) 


options nodate pageno=l ； 
proc factex ； 
factors C D/nlev=3 ； 
blocks size=3 ； 
model est=(D C*D )； 
output out=cddesign ； 
run ； 


factors A B ； 
output out=abdesign 
designrep=cddesign ； 
run ； 

data abdesign (drop=A B C D )； 
set abdesign ； 

if A=-l and C=-l then F1=0 ； 
if A=1 and C=-1 then Fl=l ； 
if A=-l and C=0 then Fl=2 ； 
if A=1 and C=0 then Fl=3; 
if A=-l and C=1 then Fl=4; 
if A=1 and C=1 then Fl=5 ； 
if B=-l and D=-l then F2=0; 
if B=1 and D=-1 then F2=l; 
if B=-l and D=0 then F2=2; 
if B=1 and D=0 then F2=3; 
if B=-1 and D=1 then F2=4; 
if B=1 and D=1 then F2=5; 
run ； 

proc print data=abdesign ； 
titlel 'TABLE 11.10'; 

title2 ' 6**2 FACTORIAL IN BLOCKS OF SIZE 12'; 

title3 '(USING PSEUDO-FACTORS)'; 

run ； 


'bs 

BLOCK 

FI 

F2 

1 

1 

0 

0 

2 

1 

0 

1 

3 

1 

1 

0 

4 

1 

1 

1 

5 

1 

0 

2 

6 

1 

0 

3 

7 

1 

1 

2 

8 

1 

1 

3 

9 

1 

0 

4 

10 

1 

0 

5 
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Table 11.10 (Continued) 


11 

1 

1 

4 

12 

1 

1 

5 

13 

2 

4 

0 

14 

2 

4 

1 

15 

2 

5 

0 

16 

2 

5 

1 

17 

2 

4 

2 

18 

2 

4 

3 

19 

2 

5 

2 

20 

2 

5 

3 

21 

2 

4 

4 

22 

2 

4 

5 

23 

2 

5 

4 

24 

2 

5 

5 

25 

3 

2 

0 

26 

3 

2 

1 

27 

3 

3 

0 

28 

3 

3 

1 

29 

3 

2 

2 

30 

3 

2 

3 

31 

3 

3 

2 

32 

3 

3 

3 

33 

3 

2 

4 

34 

3 

2 

5 

35 

3 

3 

4 

36 

3 

3 

5 


11.14 GENERAL METHOD OF CONFOUNDING FOR THE 
SYMMETRICAL FACTORIAL EXPERIMENT 

We have so far considered the s n factorial where s is (i) a prime, (ii) a prime 
power, or (iii) a product of primes. For (i) and (ii) our discussion has centered on 
the fact that the 5^ — 1 d.f. can be partitioned orthogonally into (s n — l)/(s — 1) 
components with s — l d.f. each. We saw that this provides a simple mechanism 
for constructing blocks of size s^(i < n) by confounding some of these compo¬ 
nents with blocks. For (iii) we made use of the same principle and indicated how, 
for example for s = p\pi, blocks of size p\ l p^ 2 (^2, ^2 < can be constructed. 
In all three cases this procedure may not always be entirely satisfactory for two 
reasons: (i) the block sizes are too restricted or (ii) information on important 
effects may have to be sacrificed (due to confounding). The second problem can 
be overcome to some extent by using a system of partial confounding at the cost 
of additional replications. 

The problem of restricted block sizes can often also be overcome by using 
the factorial treatment design together with existing incomplete block designs, 
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specifically BIB and PBIB designs. The drawback with this approach, however, 
is that we are giving up complete control over what information to sacrifice. In 
other words, we may have to give up partial information about many effects, 
including perhaps main effects and low-order interactions. This is expressed in 
reduced efficiency, that is, increased variance, rather than in terms of complete 
loss of certain degrees of freedom for some effects. We shall present an alter¬ 
native method that uses generalized cyclic incomplete block designs (see, e.g .， 
John, 1987) the construction of which attempts to take into account that we may 
want to retain information about main effects but may be willing to sacrifice 
some information about 2-factor interactions, for example, if that is possible at 
all. Before describing the method, however, it is useful to present some ideas 
about a calculus for factorial arrangements as developed by Kurkjian and Zelen 
(1962, 1963). I ' 

11.14.1 Factorial Calculus 

We consider the s n factorial, where 5 is a positive integer. Denote a treatment 
combination by x = (x\, X 2 ,, x n ) r where Xi represents the level of the ith 
factor Ai with 0 < <5 — 1(/ = 1 ， 2, … ， n). Let 汐 =(0, 1,... ， s — 1)’ denote 

the vector of the s levels. Then the symbolic direct product (SDP) 0 ^ 0 is defined 
as (see Kurkjian and Zelen, 1962) 

0^0 = [(0, 0),(0, 1) ， ." ， (0 ，卜 1) ，（ 1 ， 0 )， 

(1 ， 1), . . . , (1, 5 — 1— 1,0), 

0 - 1, 1)， …， 0 - l，s - 1)] (11.105) 

and 0 0 0 as 

0 ^e^0 = [(0, 0, 0),(0, 0, 1),(0, 0, 2),...,(0, 0,5-1),(0, 1,0), 

(0， 1， 1)，... ， (0， 1， s — 1)， …，(夕 — 1， s — 1， s — 1)] 

The array of all s n treatment combinations can then be represented by the n-fold 
SDP 0^ n =0(^)0 0 • • • 0 This will be referred to as the standard arrangement 
(note that this is different from the standard order given in Chapter 7 for the 2 〃 
factorial). Further, let r(x) denote the true effect of the treatment combination 
x and let r be the vector of all treatment effects arranged in standard order. A 
useful reparameterization of r is given by expressing each z(x) in terms of main 
effects, 2-factor interaction effects, and so on. We write 

n n n 

x(x) = Y j ai^i) + 



+ “12"." ( 又 1 ， *^2，• • • ， ^n) 


(11.106) 
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where at (xi ) represents the main effect of factor A/ at level Xi, aij(x“ Xj) rep¬ 
resents the 2-factor interaction effect between factors A/ and A j at levels X[ and 
xj, respectively, and so on. These effects are defined such that 

5—1 

〉: ai(Xi ) = 0 

义 /=0 

5—1 5—1 

〉: j ip^i ，•^ _/•) = 0 = 〉: Cli j {Xi , Xy) 

Xi =0 Xj =0 

that is, every effect sums to zero over the levels of any factor. Note again that 
the reparameterization (11.106) is different from that given in (11.33). Each of 
the effects is expressible as a contrast in the r(x), that is, 

^ 0 (x 1 ， 巧， ... ， x n )z(x\,x 2 , •••,〜） （ 11 . 107 ) 

义 1 X 2 X n 

with Ex! Ex 2 ••- 0(xi, X 2 ,..., x n ) = 0. Following Bose (1947b) a con¬ 

trast of the form (11.107) belongs to the g-factor interaction A/j A/ 2 ... Ai q if 
the 0(^i, X 2 ,, x n ) depend only on xt x ,Xi 2 ,, x/ (1 < i\ < i2 < … < iq S 
n\ \ <q <n). 

Now define z = (z\, Z 2 , • • •, z n ) with Zi = 0, 1 and consider 

a z = a\ l 0 a】 2 ⑭ … 0 < (11.108) 

The af by themselves are abstract quantities [Kurkjian and Zelen (1962) refer to 
them as primitive elements]. It is only in the context of (11.108) that they have 
a quantitative meaning in the following sense. 

In this expression we use the convention that af is omitted if Zi = 0. Then a z 
represents the vector of parameters of the interaction corresponding to the factors 
Ai for which Zi = l, that is, all the terms corresponding to one of the parameters 
in (11.106). Specifically, if zt x = Zi 2 = • - = Z\ q = 1, then a z represents all the 
terms . .. i q (xi l , x/ 2 ,..., Xi q ) of the ^-factor interaction A/ 2 ... A[ q . To 
express these terms as contrasts in the treatment effects, we define the contrast 
matrix 

M = s I (11.109) 


for every i 


for every pair (i ^ j), etc. 


where I is the s x s identity matrix and 3 is an 夕 x 1 vector of unity ele¬ 
ments. With 


M Zi = 



if z/ = 0 
if Q = 1 


( 11 . 110 ) 
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and with x denoting the Kronecker product, 

M z = M Z1 x M Z2 x • • • x M Zn (11.111) 

we have 

a z = —M z t (11.112) 

s n 


To illustrate (11.112), consider, for example, the main effects for factor Ai, 
that is, Z\ = l, Z2 = Z3 = - -= Zn = 0. For the sake of simplicity we shall take 
s = 3 and n = 3. Then (with subscripts indicating the dimensions of a matrix or 
a vector) 

M^ x °27 = M l x M° x M° 

=(3 / - 00 f ) x 3, x 3 ， 

= 3 I 3 X 3g — ^3^27 

^ 20 g 

=- 3 ; 23 ; - 3 “ 

\ — ^9 ~^9 ^9 / 

and 


(100) = 


l ^ t (i, j ， k)——j, k) 

jk ijk 

I f r (2, h k) — ^ j, k) 

jk ijk 

I f t(3, j, k) — — j, k) 

-jk ijk 


■ 7 ( 1 ，.，•）-〒(•，•，.) 
7(2 ， . ， . ） -7(.，.，.) 


using familiar dot notation. Similarly for the main effects for factor A 2 we have 

C = M° X M 1 x M° 

= 3 / x (3 I- 00 f ) x 3 / 

= 33 f x I x3 f -3 f x33 f x0 f 


= 3(/3 X 0 f 3 \l 3 X 0 f 3 \l 3 X ^ 3 ) - 33^27 
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and 


fl_)= r(-,2, 

_7(. ， 3, 

For the interaction between factors A\ and A 2 we obtain 

M ( 9 ^27 = M 1 x M 1 X M° 

=(3 I- 00') X (3 I- 00 f ) X 

= 9/3 x / 3 x 3^ - 3/3 x 33^3 x % - 3 0 3 0 f 3 xlix0 f 3 

+ ^3^3 X 33^3 X 0 f 3 

= 9 / 9 x ^3 - 3 / 3 x -3 a 3 ^3 x / 3 x ^3 + 3 9 3 ’ 27 


and 




T(l ， 1， .） _ "T(l, •，.）— T(. ， 1, .) + T(., •，.) 

7(1 ， 2, •）— T(l，•，.） — T(-, 2, .) + ?"(•，•，.) 
T(l ， 3, .）— T(l，•，.）— 7(. ， 3, .) + ?"(•，.，.) 
1(2, 1， .） — T(2, *, •) — T(*, 1, .) + ?"(•，.，.) 
7(2, 2, •) — T(2, •, •) — T(*, 2, •) + f(-, •, •) 
7(2, 3, .）— T(2, — 7(. ， 3, .) + ?"(•，.，.) 

7(3, 1， •）— T(3, •，•）— T(. ， 1， .）+ 〒(•，•，•) 
〒 (3, 2, .）— T(3, *, •) — T(-, 2, •) + T(*, *, •) 
^(3, 3, •) — T(3, •，.）一 T(. ， 3, •）+ T(.，.，.) 


It is easy to verify that, indeed, 


T(l ， 1 ， .）一 T(l ，•，.）一 T(. ， 1， .）+ T(.，•，.）= “12(1 ， 1) 


for example. Hence, any contrast belonging to the 2-factor interaction A\ x A 2 
can be expressed as 


cD = ^ 110 M (110) r 


where the elements of i\\o add to zero. In general, the number of linearly inde¬ 
pendent contrasts belonging to the effect represented by a z for a fixed z is 
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given by 

n n 

v(z) = rank(M z ) = rank(M Zi ) = — l) Zi 


11.14.2 Orthogonal Factorial Structure (OFS) 

Our aim is to construct suitable systems of confounding for the general s n fac¬ 
torial. In order to do so and to identify such systems that have certain desirable 
properties, it is useful to look ahead at the analysis and deduce from it structural 
properties that must hold for the error-control design and the treatment design. 

Recall (see Chapter 1) that the reduced normal equations for an incomplete 
block design are given by 


Cr= Q (11.113) 

where, for an equireplicate proper design, 

1 , 

C = rl - -NN f (11.114) 

k 

and 

1 , 

Q = T - -NB 
^ k 

The error-control design is determined by the incidence matrix N and the struc¬ 
ture and properties of the design can best be characterized through the form of 
the concordance matrix NN f . All the systems of confounding derived in this and 
previous chapters possess the following two properties: 

1. Each contrast belonging to a certain interaction is estimated orthogonally 

to any other contrast belonging to a different interaction. That is, if for the 
p n factorial [f:。 1 q Ef l and di E^ 2 are two contrasts belonging to 

E 011 and E 012 , respectively, then 

cov ([ c; ff 2 ) = 0 

2. The treatment sum of squares SS(Z T |3, X^) is partitioned orthogonally 
into the individual interaction sums of squares: 

SS^irJ,^) = ^SS(£ a ) (11.115) 

CL 

where SS ( 五 a ) is the sum of squares due to E a and is the sum over 
all oc such that E a is not completely confounded (i.e.，belonging to £2 and 
£3 as defined in Section 11.6). 
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Property 2 follows, of course, from property 1. Factorial designs having these 
properties are said to have orthogonal factorial structure (OFS). The importance 
of OFS is that it simplifies greatly the interpretation of the analysis of a factorial 
experiment. For example, based on significance tests in the ANOVA certain 
interaction terms may be deleted from the model without affecting the estimates 
of other terms remaining in the model. 

We know from earlier discussion (see Chapter 1) that 

SS(Z r |a, = r' Q = x'Ct= Q C CC Q (11.116) 

It is clear from (11.115) and (11.114) that SS(X T |3, Xp) is a function of NN'. 
Now the structure of C is the same as the structure of NN f . We have seen earlier 
(see, e.g., Chapters 2 and 4) that for block designs the structure of C~ is the 
same as that of C. We may then ask the questions: (i) For what structure of 
these matrices do we have an OFS and (ii) how can one exploit that structure 
to generate appropriate systems of confounding? We shall now give answers to 
these questions. 

11.14.3 Systems of Confounding with OFS 

Consider as given in (11.112). It follows then from (11.113) that 

a z = 丄 M z C-g 

s n 

with 

var^) = 
and 

cov(a z ,a w )^ 

for z ^ w. More generally, for two contrasts belonging to a z and a w , respectively, 
we have 

co\[i f z a z , i r w a w ] = cov[€ r z M z< c, i r w M w< c] 

= -^l' z M z C~{M w )' i' w (11.117) 

Furthermore, the sum of squares due to testing the significance of a z is given by 

SS(a z ) = (M z 〒)’ [M Z C-(M Z Y]~ (M z t) (11.118) 

With the reparameterization (11.106) OFS is achieved if expression (11.117) is 
zero and hence analogous to (11.115), 

SS(Z T |3,Z / i) = ^ SS(a z ) 


(11.119) 
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where is the sum over all binary n vectors except (0, 0, ... ， 0). We denote 
the set of all such vectors by Z. Since (11.117) must be equal to zero for all 
choices of i z and l w , the following theorem holds. 

Theorem 11.11 For the s n factorial with parameterization (11.106) a block 
design will have OFS if 

M z C~(M w y = 0 (11.120) 

for all z,w e Z(z ^ w), where M z is defined in (11.111). 

The following theorem, which will be instrumental in constructing block 
designs with OFS, is a special case of a more general theorem due to Cotter, 
John, and Smith (1973). We need to introduce some notation first. For simplicity 
of notation we shall denote the matrix C~, a generalized inverse of C in (11.114), 
by D. We shall refer to a cyclic partition of a matrix H of order m\m 2 x m\m 2 
if H can be written as 

" H X H 2 H 3 H mx 

H H H i H 2 H 

. H 2 H 3 H mi Hi 

where Hi (i = 1,2,..., mi) is of order m 2 x m 2 . We write (11.121) for short as 
H = {H l ,H 2 ,...,H mi } = {(H i )} (11.122) 

We now state the following theorem. 

Theorem 11.12 A block design N for an s n factorial has OFS if the matrix 
C~ = D has the following cyclic partition: 

= { (^iih)} 

^hh = {( 认 ”_“ 3 )} 

Di'i 2 」 n — 2 = { (^iih-in-i)} 

for ij = 1,2,... ,s, j = 1 ， 2,... ， n — 1, and if the 5 1 x 5 matrix has 

row and column sums all equal. 
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Proof The proof is patterned after the results of Cotter, John, and Smith 
(1973) and John and Smith (1972). We show that (11.120) is true by using 
induction. First we show that the theorem is true for n = 2. 

For n = 2 the set Z consists of three vectors: (1,0), (0,1), (1,1). Correspond¬ 
ingly we have, using (11.111 )， 

= (sl s - 0 S 3 ' S ) x J' s 

=Sl s X 3; — X 

= sP\ — 0 S J f s 2 

where, using the notation of (11.122), 

Also 

M( 01) = 3 ' s x (sl s - 0 s 0 f s ) = sP 2 - 0 5 3 ^ 

with 

Pi = (Is, Is, Is) 


and finally 


M (11) = {sl s - 3,3；) x {sl s - 3, 圮） 

= {i s , o — ， 0} - •? {K o,..., o} 

-s{I s , I S ,...,I S } + {H H , U;} 

Now consider 


M (W) D 


=s PiD-0 s 0 f s D 

= s 0 f s , c?2 ,... ， 4 }— 


where di is the row and column sum of D[ in D = {D\, D 2 ,..., D s }. Since 
D3 f M z = 0 for z G Z it follows that 


M m D (M m ) = s {di0 r 5 , d 2 %, ...,d s d' s }{sP 2 - OsO^Y 




= 0 





456 


GENERAL SYMMETRICAL FACTORIAL DESIGN 


Similarly, 

M m DM {n)， =s ㈣ X ， … 乂 3;} M (11) 

= {d\^ r s , di3’ s ， … ， d s 3 f s ] — 5 3 {d\3[ s , di3’ s ， … ， d s 0 f s ] 

- (J2 di) 0 s 3' s2 + s 2 di) 0 S 0' S 2 = 0 

Finally, 

M m D = sP 2 D-3 s 0 ， s2 D 

=J (!>，us 

and hence 

—/ ((E《)n (^>) n …， (E4) 以 ) 

+ ^ 2 ((J>) H g>) K .. ， ([ 今 , 圮 ) 


= 0 

Now assume that the theorem holds for the s w_1 factorial, say with fac¬ 
tors A 2 , A 3 ,A n . If we write the z vector for n — l factors as z* = 
(Z2, Z3,... ， Zn), we then have for z* # # (0, 0,. •. ， 0) 

M z *Di(M w *y = 0 (11.123) 

for i = 1,2,... ,s with the row and column sum of Dj equal to d“ say. Using 
(11.123), we now show that 


M z D(M w Y =0 

for z ^ w. Now M z DM wf can be written as 

(M Z1 x M z *) {Du D 2 ,... ， D s ] (M W1 x M w *)' (11.124) 

To show that (11.124) equals zero, it is sufficient to show that M z *Di(M w *) r = 
0 for every possible z*, in z = (zi, z*) and w = (w\, w^) with z ^ w ^ 
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(0, 0,… ， 0). This can be seen by simply multiplying (11.124) out. We distinguish 
between three cases: 

1. z* = 0* = (0, 0,… ， 0): We then have u;* # (0, 0, •.. ， 0) since otherwise 

z = w. Now, by definition, M°* = J s n-i. Hence = 山 3’ ， and since 

M w * is a contrast matrix, it follows that M z * DiM w * f = 0. 

2. z* = 抝氺 # 0*: We then have z\ 7 ^ w^i and without loss of generality assume 
zi = 0. Then 

M z =0 f s x M z * = {M z \ … ， M z *) 
and consequently 

M Z D = (M z * [ A .，- - MZ * J2 Di ) 

= 0' s 

It follows then that 

M z D(M w y = ^ x J2 MZ * D i) ( MW1 x MW *)' 

= o' s (M wi y x (j2 MZ * D i( MW *y) = 0 

3. z* ^ w^: By assumption the result is true. 

We have therefore shown that if the theorem holds for the s n ~ l factorial it also 
holds for the s n factorial, and we have shown that it holds for the s 2 factorial. 
This completes the proof. □ 

11.14.4 Constructing Systems of Confounding 

We shall now make use of Theorem 11.12 in our attempt to construct useful sys¬ 
tems of confounding for the s n factorial. Obviously, OFS is a desirable property 
of such a system of confounding. The idea then is to construct an incomplete 
block design for the s n factorial such that the matrix C_ = D has the structure 
given in Theorem 11.12. In some sense such an approach may seem to be back¬ 
ward since an incomplete block design is characterized first of all by its incidence 
matrix N and hence, equivalently, by NN f or C. The structure of C then deter¬ 
mines the structure of C_. Now we know that for PBIB designs the structures 
of C and C 一 are the same (see Chapter 4). We therefore want to find a PBIB 
design in which to accommodate the s n factorial such that the C matrix has the 
form specified in Theorem 11.12 for C~, that is, hierarchical cyclic partitions. 
Such a method was proposed by John and Dean (1975) and can be described as 
follows. 



458 


GENERAL SYMMETRICAL FACTORIAL DESIGN 


As before, we denote a treatment combination by x' = (xi, ^ 2 ,..., x n ) where 
0 < Xi < s — l(i = 1,2,, n). The totality of the treatment combinations, X, 
is arranged in lexicographic order, that is，if 0 = (0, 1,.. •, s — 1)’，then 

X = 0 ^)0 ■ ^)0 

(see Section 11.14.1). Following John (1973) we consider the use of generalized 
cyclic PBIB designs, GC/n-PBIB, (see Chapter 5) with blocks of size k. Recall 
that such a design is obtained from an initial block of k treatments. The jth block 
is obtained by adding the jth treatment combination to the treatments in the 
initial block. Here addition of two treatment combinations x r = (xi, X 2 ,..., x n ) 
and y = (yi, • • •, yn) is defined as 

x’ + / = (XI ， +J1,X2, +J2,...+ y n )mod s (11.125) 

Let the highest common factor (HCF) of s and the nonzero elements of x 
be denoted by HCF^y ， x) = h and q = s/h. Then the treatment combinations 
x,2x,... ,{q — 1) x, qx = 0 mod s form a cyclic Abelian group G with addition 
as defined in (11.125). Here, G contains q distinct elements. The implication of 
this with respect to GC/n-PBIB designs is as follows. If 0, x, 2x,..., (q — l)x 
represent the elements of the initial block of size k = q and if additional blocks 
are obtained by adding in turn each treatment of the set X to those in the initial 
block, then the block obtained by adding ux{u < q) contains the same treatments 
as the initial block. Hence the initial block generates a fractional GCIn design 
with s n /q distinct blocks such that each treatment combination occurs exactly 
once. We shall also say that x generates the fractional GC/n design. 

More generally, we can consider g generators X\,X 2 ,... ,x g with correspond¬ 
ing Abelian groups G\, G 2 ,..., G g . For the application to be described, the 
generators will have to be chosen such that the G/ have only the 0 element in 
common. Let HCF (s, Xi) = hi, qi = s/hi and q = qi. We consider then 
the group G formed by taking the direct sum of G\, G 2 ,..., G g , the general 
element being given by 

u\X\ -j- U 2 X 2 H - • • • H - UgXg (11.126) 

with w/ = 0, 1 ， … ，切 一 1, i = 1, 2,..., g. The q elements of G, taken as the 
initial block, then generate a GC/n design with s n /q distinct blocks and every 
treatment combination in X replicated once. To illustrate this concept, consider 
the following example. 

Example 11,9 Suppose we have s = 6, n = 2, that is, the 6 2 factorial. Let 


X ； = (0, 3), 4 = (3,0), 4 = (2, 2) 
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It follows then that 


h\ = 

=HCF [ 6 , (0, 3)] 

= 3, 


h2 = 

=HCF [ 6 , (3, 0)] 

= 3, 

qi 

办 3 = 

=HCF [ 6 , (2, 2)] 

= 2 , 

^3 


and q = 12. The elements of G are obtained from (11.126) with u\ = 0, 1, U 2 = 
0 , 1 , W 3 = 0 , 1 , 2 as 

(0,0) (0,3) (3,0) (3,3) (2,2) (2,5) 

(5,2) (5,5) (4,4) (4,1) (1,4) (1,1) 


This is the initial block, B\ say. The remaining blocks are obtained by adding a 
treatment combination not in B\ to each element in B\. To obtain B 2 , we add 
( 1 , 0 ), which yields 


(1,0) (1,3) (4,0) 

(4, 3) 

(3,2) 

(3,5) 

(0,2) (0,5) (5,4) 

(5,1) 

(2, 4) 

( 2 , 1 ) 

and for B 3 we add ( 0 , 1 ), which yields 




(0,1) (0,4) (3,1) 

(3,4) 

(2,3) 

( 2 , 0 ) 

(5,3) (5,0) (4,5) 

(4,2) 

(1,5) 

( 1 , 2 ) 

The design is summarized in Table 11.11. 



We note here that the initial block, B\ or 

G, acts 

as the IBSG discussed 

earlier. 



□ 


11.14.5 Verifying Orthogonal Factorial Structure 

We shall now show that systems of confounding based on GC/n-PBIB designs 
have OFS. We do this by showing that the C matrix and hence the C~ = D 
matrix have the structure of Theorem 11.12. 

Let R l s denote a.ns x s circulant matrix with a unity element in the ith position 
of the first row and zero elements elsewhere. A circulant matrix 

^ G\ Cl2 ^3 •.. 、 

Cl^ Cl J ... Civ — 1 

A = 

\Cl2 Cl2 CI4 ... a\ J 
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Table 11.11 6 2 in Blocks of Size 12 
(Using GC/n Method) 

Block 1 

Block 2 

Block 3 

0 

0 

1 

0 

0 

1 

0 

3 

1 

3 

0 

4 

3 

0 

4 

0 

3 

1 

3 

3 

4 

3 

3 

4 

2 

2 

3 

2 

2 

3 

2 

5 

3 

5 

2 

0 

5 

2 

0 

2 

5 

3 

5 

5 

0 

5 

5 

0 

4 

4 

5 

4 

4 

5 

4 

1 

5 

1 

4 

2 

1 

4 

2 

4 

1 

5 

1 

1 

2 

1 

1 

2 


can then alternatively be written as 

s s 

A = X at) = ^2 a i Rl s 

i=l i=\ 

As mentioned in Section 5.3, the matrix NN r and hence C for a cyclic PBIB 
design is a cyclic matrix, that is, C can be written as 

t 

c = E c , /?; 

i=l 


For the situation considered here where a treatment consists of an n-tuple (x\,X 2 , 
..., x n ) this expression generalizes to 

i\ = \ i.2=l in = l 



where 


~[ xR'J ^ R^ 1 x R^ 2 x ■■■ x 
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and 



q 

一 1 

c i\h-in ' q 


for i\ = i 2 = ' m ' = in = 1 

for x f = (/i — 1, ••- in — 1) in B\ 

but x’ # (0, 0,... ， 0) 


= 0 otherwise 


The reasons for the specific values of Cia are, of course, (i) we have a single- 
replicate design, that is, r = 1; (ii) the block size k is equal to q = nf=,^；and 
(iii) we have a disconnected design, that is, any two treatments occur together in 
the same block once or not at all. Alternatively, C can be written as 


n 



c = i-~y 

q ^ 

… E 

n ^ 


n 

in 

7=1 

where 




c *.( 

= 1 if = (/i - 

1, h - 

1, ..., - 1) in B\ 


= 0 otherwise 




Now C in (11.127) can be written also (see John, 1973) as 


c = … I ： 



where y = n — 1. If we define 


E c iih-in^ l s 


^n=l 


then (11.129) and hence (11.127) can be rewritten as 


n iv 



(11.128) 


(11.129) 


(11.130) 


(11.131) 


This, however, is just another way of writing the hierarchical cyclic partition 
required in Theorem 11.12. From the definition of C/ u - 2 .../ v in (11.130) it follows 
immediately that the row and column totals for are all equal, namely 

Ci x i 2 ,„i n . It follows then from Theorem 11.12 that the GC/n-generated system 
of confounding has OFS. 
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11.14.6 Identifying Confounded Interactions 

The question remains of which effects a z are confounded with blocks or, more 
precisely, to which effects a z do the b — l = s n /q — 1 block degrees of freedom 
belong? Since we have a disconnected design with b = s n /q blocks it is clear 
that only s n — l — (b — l) degrees of freedom remain for linearly independent 
estimable functions belonging to main effects and interactions. We know from 
our earlier discussion that without confounding the degrees of freedom for a z are 


n 

V{z) = Y\^ s - 1)Zi (11.132) 



We shall denote the corresponding degrees of freedom for a given system of 
confounding by v c (z). We then show the following theorem. 


Theorem 11.13 (John and Dean, 1975) For a GC/^z-generated system of 
confounding the degrees of freedom for the effect a z are given by 


v c (z) = rank(M z ) - X! … I n 

q h in 


Zj 


(11.133) 


where 


Pi 


Zj 


5 — 1 if ij = 1 and Zj 

— 1 if ij ^ 1 and Zj 

1 if zj = 0 


and the c; 




are defined in (11.128). 


Proof Recall from (11.118) that the adjusted sum of squares for a z is given by 


SS(fl z ) = (M z x)\M z C-M z 'r{M z 7) 


John and Dean (1975) show that the degrees of freedom for SS(a z ) are given by 


Mz) 



tr (M Z CM Z， ) 


(11.134) 


where tr( ) denotes the trace of a matrix. Now 

tr(M z CM z O = tr(M z M z/ ) -去 E … 《 fl tt {m_M 叫 

q h in 7=1 
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with 


and 


\x{M z M z， ) =s n 



~[Cy-l 户 



for Zj — 1 
for Zj = 0 


(11.135) 


(11.136) 


Substituting (11.135) and (11.136) into (11.134) and using the fact that 
rank(M z ) = Ylj( s ~ l) z ) prove the theorem. □ 

As an immediate result from Theorem 11.12, we state the following corollary. 


Corollary 11.3 The number of contrasts belonging to a z that are confounded 
with blocks is equal to 


作)=套1：… 

/1 in 



(11.137) 


It follows from (11.137) that (i) v*(z) is determined entirely by the treatment 
combinations in the initial block (ii) if v*(z) = 0, then a z is unconfounded; 
and (iii) if v*(z) = Hj C 5, — l) z; ，then a z is completely confounded. 

We shall illustrate Corollary 11.3 in terms of Example 11.9. 

Example 11.9 (Continued) We need to find Xlf 1=1 X]f 2 =i c hi 2 ^h^h' Using 
the definition of cf“ 2 and B\ given in Table 11.11，the values for p? 1 with 
nonzero coefficients and for z = (1, 0), (0, 1), (1, 1) are given in Table 11.12. 
It follows then that y*= (1,0) = y*(0, 1) = 0 and y*(l, 1) = = 2, that is, 

the main effects are unconfounded and 2 d.f. from the 2-factor interaction are 
confounded with blocks and as a consequence v c (l, 1) = 23. □ 


11.15 CHOICE OF INITIAL BLOCK 

It is worthwhile to compare the designs given in Tables 11.8 and 11.11，which 
were constructed by two different methods. The method of pseudofactors is easy 
to use but results in the confounding of degrees of freedom belonging to the 
main effect A i. The advantage of the GC/2 design is that it leads to confounding 
of degrees of freedom belonging to the interaction A 1 A 2 . This will in general be 
more desirable but not as easy to achieve since the design depends crucially on 
the choice of the initial block. 



464 


GENERAL SYMMETRICAL FACTORIAL DESIGN 


Table 11.12 Determination of for Main Effects and 

Two-Factor Interaction for 6 2 Factorial in Blocks of Size 12 


h, h 

Zl = 1, Z2 = 0 

Z1 = 0, Z2 = 1 

Z1 = 1 ， Z2 = 1 

1,1 

6-1 

6-1 

(6-1)2 

1,4 

6-1 

- 1 

-(6-1) 

4,1 

- 1 

6-1 

-(6-1) 

4,4 

一 1 

-1 

1 

3,3 

-1 

- 1 

1 

3,6 

- 1 

- 1 

1 

6,3 

-1 

- 1 

1 

6, 6 

-1 

- 1 

1 

5,5 

-1 

- 1 

1 

5,2 

-1 

- 1 

1 

2,5 

-1 

-1 

1 

2,2 

- 1 

- 1 

1 

Sum 

0 

0 

24 


How then should one choose the initial block? Two questions need to be 
answered: (i) How many generators does one need? (ii) What should these 
generators be? First of all, only block sizes of the form k = q = are 

permissible, where the qt’s are divisors (not necessarily different) of 5 (including 
s itself). For example, for the 6 2 in blocks of size k = 12, we could have g = 3 
with qi = q 2 = 2, = 3 (see Example 11.9) or g = 2 with q\ — 6, q 2 = 2. 

To answer (ii), one needs to choose the generators X\,X 2 ,... ,x g such that 
the groups Gi, G 2 ,..., G g only have the zero element in common. The choice 
depends further on which effects one is willing to confound to the extent that 
one has a choice. One may have to do this by trial and error, computing for 
each choice of generators v c (z) for each z, and then decide what is the best 
choice for the situation at hand. If one wants to avoid confounding degrees of 
freedom belonging to main effects, one can proceed in the following way (this 
follows from the discussion by John and Dean, 1975). If we write the generators 
as x\ = (x/i, x/ 2 , •.., Xi n )(i = 1 , 2 ,..., g), we can consider the vector of the 
jth components (x\j,X 2 j, …， x g j)(j = 1,2,..., n). If for each j the nonzero 
Xij are relative primes, then no degrees of freedom belonging to main effects 
are confounded. As an example consider for the 6 2 in blocks of size 12 the fol¬ 
lowing two sets of generators: (1) x\ = (1, 5), X 2 = (3, 0) and (2) x\ = (3, 5), 
X 2 = (3, 0), giving, respectively, the initial blocks B \： 

00, 30, 15, 45, 24, 54, 33, 03, 42, 12, 51, 21 

and 

00, 30, 35, 05, 04, 34, 33, 03, 02, 32, 31, 01 
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Using Corollary 11.3 or, alternatively, simple inspection shows that, for system 
1, 2 d.f. from A\A^ are confounded whereas, for system 2, 2 d.f. from A\ are 
confounded. 

We close this section by mentioning that the GCM designs are most valuable 
for the s n factorial where s = p? p^ 2 … p^ m as the method provides some flex¬ 
ibility with block sizes and allows the constructing of systems of confounding 
such that main effects remain unconfounded (if that is possible at all). This will 
play a role when we discuss systems of confounding for asymmetrical factorial 
experiments in Chapter 12. Finally, fors = p and p prime the GCIn designs are 
of the same type as those discussed in Section 11.3. 



CHAPTER 12 


Confounding in Asymmetrical 
Factorial Designs 


12.1 INTRODUCTION 

In previous chapters we have considered systems of confounding for symmetrical 
factorial experiments, that is, the s n experiment in s n ~ l blocks of size s^. In 
many practical situations, however, we may not encounter such a pure system, 
but rather find that different factors have different numbers of levels. We call this 
an asymmetrical or mixed factorial experiment of the form s 1 ^ 1 x s^ 2 x . • • x Sq q , 
where the ith set of factors contains 〜 factors each at Si levels (/ = 1 , 2 ,..., q). 
A typical example is the 2 x 3 2 experiment, that is, one factor having two levels 
and two factors having three levels each. For such an experiment we may want 
to use blocks of size { x s^ 2 x … x , for example, for the 2 x 3 2 exper¬ 
iment we may be interested in finding a suitable arrangement of the treatment 
combinations in blocks of size 2x3 = 6. 

Various methods have been proposed to deal with this situation. Generally 
speaking, they can be divided into the following broad categories: 


1. By looking at each component pure system separately and using the pro¬ 
cedures discussed in previous chapters, it may be possible to generate an 
appropriate design. For example, for the 2 2 x 3 2 factorial experiment, we 
can examine the 2 2 system in blocks of size 2 and the 3 2 system in blocks 
of size 3. Suitable arrangements then lead to blocks of size 6 or 12 for the 
asymmetrical factorial experiment. 

2. One can generalize the formal method for pure systems by defining how to 
combine elements from different finite fields, for example, elements from 
residue fields mod 2 and mod 3. 

3. The construction of suitable PBIB designs can lead to useful systems 
of confounding. We mention here in particular the EGD-PBIB design 
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(Section 4.6.9) as its association scheme is based upon the fact that the 
number of treatments t can be factored in various ways, for example, for 
the 2 x 3 2 factorial t = 2 • 3 • 3. 

4. The factorial treatments can be embedded in a suitable PBIB design, where 
the form of the association scheme suggests a correspondence between the 
treatments in the usual sense and the factorial treatments. 

5. By introducing pseudofactors, we may be able to reduce the asymmetrical 
factorial experiment at least formally to a pure (symmetrical) factorial exper¬ 
iment and then apply the familiar procedures. For example, a 2 3 x 4 factorial 
can be interpreted as a 2 5 factorial. However, care must be taken in that the 
interaction between the last two factors in the 2 5 system is really part of the 
main effect of the factor with four levels (see Section 11.13.3). 

We shall discuss some of these methods in more detail and give suitable 
designs for some that we consider to be the most useful and most commonly 
occurring cases. 

12.2 COMBINING SYMMETRICAL SYSTEMS OF CONFOUNDING 
12.2.1 Construction of Blocks 

To present the basic ideas, let us consider the situation involving only two sets 
of factors, the first set consisting of n\ factors A\, A 2 , .. A ni each at levels 
and the second set consisting of ri 2 factors B\, B 2 , .B n2 each at S 2 levels. 
For the sake of simplicity we shall take ^1 = p\ and 幻 = P 2 , where p\ and p 2 
are different primes, but the results can be generalized easily to the case where 
s\ and/or ^2 are prime powers by using the methods described in Section 11.7. 
We denote a treatment combination by the n\ + 112 vector (x\, X 2 , .. x ni , y\, 
y2, • • y ni )^ where x t = 0, 1 , 2 , ..pi - 1 (/ = 1 , 2 , .. ni), yj = 0, 1 , 2 ,.. 
P 2 - 1 0* = 1, 2, .. n 2 ). 

Suppose then we want to arrange the p 1 ^ treatment combinations in blocks 
of size p\ l < " 1，€2 i "2 with ^1 + ^2 < ^1 + 行 2 ). To this end we consider 

separately the two symmetrical factorial systems, the p[ l factorial in blocks of 
size p\ l , and the p ^ 2 factorial in blocks of size . Using the methods described 
in Chapter 11, we choose n\ — independent interactions to be confounded 
(together with their GIs) in the p? system. Suppose these effects are E ai , E ai , 
.. E av (v = n\ — £\). The arrangement of the p? treatment combinations in 
blocks of size p\ l is then obtained by solving the systems of equations 

a\\x\ + a\2X2 H - + ai ni x ni = 8 \ 

d2\X\ + Oi22X2 H - h a2niX ni = ^2 


H - ^v2^2 + . . . + ^vn\^n\ = 


( 12 . 1 ) 
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for the p v x = 1_£l possible choices of ( 〜， 82 , … ， 5 V ), one for each block. 

Similarly, suppose that the — independent interactions to be confounded 
for the p ^ 2 factorial in blocks of size p ^ 2 are E 择 2 , = n:— 名 2 ). 

The corresponding systems of equations are then 


Pliyi + 卢 1272 H - \- Pln 2 yn 2 = Yl 

P2\y\ + Pliyi H - + P2n 2 yn 2 = Y2 

i (12.2) 

Pjji\y\ + Pjxiyi + . • • -\-P1xn2yn2 = 


We thus obtain blocks for the p? factorial, say Bn, B12, .. B\ p v, and 
blocks for the p ^ 2 factorial system, say B 21 ， B 22 , …, B 2 成 . To obtain the blocks 
for the p n x l x p ^ 2 system, denoted by 5*. (/ = 1 , 2 , ../ 7 j; y = 1 , 2 , .. p&), 
we combine the blocks from the individual systems as follows. To obtain the 
treatment combinations in block Bfj, we “adjoin” to each treatment combination 
(xi ， X2, .. x ni ) in Bu every treatment combination (yi, y2, .. y ni ) i n ^ 2 j to 
yield p \ l treatment combinations of the form (x\, X2, .. x ni , y\, y2 ,.. 
y n2 ). Symbolically this can be expressed as 


B^ = Bu ⑧ B 2j (12.3) 

which has been called the symbolic direct product (SDP) by Kurkjian and 
Zelen (1962). ^ 

Example 12.1 To illustrate the procedure, we consider the 2 3 x 3 2 factorial 
in blocks of size 2x3 = 6, that is, p\ = 2, p 2 = 3, n\ =3， = 2, €1 = 1, 
€2 = 1 - Suppose we choose to confound A 1 A 2 and A\A^ (and hence A 2 A 3 ) and 
B\B 2 , respectively, in the two symmetrical systems. The block compositions are 
determined by the equations 


xx~\~ X 2 = mod 2 

xi + X 3 = 82 mod 2 
and 

^ 1+^2 = Yl mod 3 

respectively. This yields the blocks 

Bn B\2 ^13 ^14 

000 100 010 001 

111 011 101 110 
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and 


B 21 

忍 22 

公 23 

00 

10 

20 

12 

01 

02 

21 

22 

11 


The blocks for the asymmetrical factorial then are given in Table 12.1. The same 
design using SAS PROC FACTEX is given in Table 12.2. □ 

There is another way of representing the result given above. Let x represent 
the column vector of the treatment combinations for the p^ 1 system and y that 
for the p^ 2 system. Further, let N\ and N 2 denote the incidence matrices for 
the two systems of confounding in blocks of size p\ l and p^ 2 , respectively. The 
final design can then be expressed as the Kronecker product 

N = Nix N 2 (12.4) 

where the vector of treatment combinations is given by the SDP x 0 j and the 
blocks are labeled as in (12.3). 

12.2.2 Properties of Kronecker Product Method 

We shall comment briefly on some of the properties of the procedure described 
above. For any interaction confounded in the p? system, say E 01 ，and any inter¬ 
action confounded in the system, say also the interaction E U E^ is 


Table 12.1 Block Arrangement for the 2 3 x 3 2 
Factorial in Blocks of Size 6 


b i*i 

D* 

B 12 

Z? 氺 

万 13 

D 氺 

B 2\ 

D* 

B 22 

D* 

《23 

00000 

00010 

00020 

10000 

10010 

10020 

00012 

00001 

00002 

10012 

10001 

10002 

00021 

00022 

00011 

10021 

10022 

10011 

11100 

11110 

11120 

01100 

01110 

01120 

11112 

11101 

11102 

01112 

01101 

01102 

11121 

11122 

11111 

01121 

01122 

01111 

Z? 氺 

B 31 

D* 

B 32 

D 氺 

^33 

B *» 

B 42 

Z? 氺 

《43 

01000 

01010 

01020 

00100 

00110 

00120 

01012 

01001 

01002 

00112 

00101 

00102 

01021 

01022 

01011 

00121 

00122 

00111 

10100 

10110 

10120 

11000 

11010 

11020 

10112 

10101 

10102 

11012 

11001 

11002 

10121 

10122 

10111 

11021 

11022 

11011 
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Table 12.2 2 3 x 3 2 Factorial in Blocks of Size 6 


options nodate pageno=l ； 

proc factex ； 

factors ABC; 

blocks size=2 ； 

model est=(ABC A*B*C); 

output out=abcdesn ； 

run ； 

factors D E/nlev=3; 
blocks size=3 ； 
model est=(D E )； 
output out=dedesn 
designrep=abcdesn ； 
run ； 


data dedesn(drop=blockl block2 )； 
set dedesn ； 


blockl=block ； 
if blockl=l and 
if blockl=l and 
if blockl=l and 
if blockl=2 and 
if blockl=2 and 
if blockl=2 and 
if blockl=3 and 
if blockl=3 and 
if blockl=3 and 
if blockl=4 and 
if blockl=4 and 
if blockl=4 and 


block2=l 

block2=2 

block2=3 

block2=l 

block2=2 

block2=3 

block2=l 

block2=2 

block2=3 

block2=l 

block2=2 

block2=3 


then block=l ； 
then block=2 ； 
then block=3; 
then block=4 ； 
then block=5; 
then block=6; 
then block=7 ； 
then block=8; 
then block=9; 
then block=10 ； 
then block=ll ； 
then block=12 ； 


run ； 


proc sort data=dedesn ； 

by block ； 

run ； 

proc print data=dedesn ； 
titlel 'TABLE 12.2 '； 

title2 '2**3x3**2 FACTORIAL IN BLOCKS OF SIZE 6'; 
run ； 
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Table 12.2 {Continued) 
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Table 12.2 (Continued) 


43 

8 

-1 

44 

8 

-1 

45 

8 

-1 

46 

8 

1 

47 

8 

1 

48 

8 

1 

49 

9 

-1 

50 

9 

-1 

51 

9 

-1 

52 

9 

1 

53 

9 

1 

54 

9 

1 

55 

10 

-1 

56 

10 

-1 

57 

10 

-1 

58 

10 

1 

59 

10 

1 

60 

10 

1 

61 

11 

-1 

62 

11 

-1 

63 

11 

-1 

64 

11 

1 

65 

11 

1 

66 

11 

1 

67 

12 

-1 

68 

12 

-1 

69 

12 

-1 

70 

12 

1 

71 

12 

1 

72 

12 

1 


1 

1 

1 


1 

1 

1 


1 

1 

1 


1 

1 

1 


1 

1 

1 


-1 

-1 

-1 

1 

1 

1 

-1 

-1 

-1 

1 

1 

1 

-1 

-1 

-1 

1 

1 

1 

-1 

-1 

-1 

1 

1 

1 

-1 

-1 

-1 

1 

1 

1 


0 

1 


0 

1 


0 

1 


0 

1 


0 

1 


0 

1 


0 

1 


0 

1 


0 

1 


0 

1 


confounded. This can be seen easily by referring to Table 12.2. Recall that the 
p\ — I d.f. for E a arise from p\ — 1 linearly independent contrasts among the 
components E^, ..and the p 2 — 1 d.f. for arise from contrasts 

among E^, , .. •， E^_ v The (E? ， entry in Table 12.2 is essentially the 

average of the responses for all treatment combinations in the p ^ 1 p ^ 2 system 
that satisfy the equations 

ot\x\ + (X 2 X 2 H - h = / mod pi 

and 

Piyi + Piyi H - h Pn 2 yn 2 = j mod P 2 

and the (pi - l)(p 2 - 1 ) d.f. for E a E p arise entirely from contrasts among 
the P 1 P 2 entries ( 五 fin Table 12.3 with / = 0 ， 1 ， ...， pi — 1 and j = 0, 
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Table 12.3 Representation of Interaction Contrasts for 
Factorial 



1 ，…， /?2 — 1- This implies then that with this procedure we cannot confound 
interactions between A factors and B factors without confounding at the same 
time also the corresponding A factor interaction and the B factor interaction. In 
terms of Example 12.1 we confound with A\A 2 , AiA 3 , A 2 A 3 , and B\B 2 also 
A 1 A 2 B 1 B 2 , A\A 2 B\B 2 , and 焱 2 焱 3 召 1 召 2 , each accounting for 2 d.f. The possi¬ 
bilities for useful systems of confounding are therefore somewhat limited in that 
one may sometimes be forced to confound low-order interactions or even main 
effects. Partial confounding, if feasible, may resolve this problem to some extent. 

From the construction of the designs it is clear that these systems of con¬ 
founding have OFS as defined in Chapter 11. Expressions for sums of squares 
associated with effects in the pure systems can be written out easily using the 
methods discussed in earlier chapters. The same is true for interaction sums of 
squares involving factors from different pure systems, for example, SS(Ai x B\) 
in Example 12.1, as long as the interaction is not confounded. There do not, 
however, seem to be simple expressions for interaction sums of squares when 
the interactions involve factors from different pure systems and are partially 
confounded, for example, SS(Ai x A 2 x x B 2 ) in Example 12.1. Because 
A\A 2 B\B 2 is confounded with blocks, we can write formally 

SS(Ai x A 2 x B { x B 2 ) = SS(AiA 2 BiB^) 

with 2 d.f. The difficulty in writing out an expression for SS(AiA 2 ^iB|) using 
familiar methods is the fact that this would involve arithmetic with elements 
from different fields. An attempt has been made by White and Hultquist (1965) 
and a more useful formulation will be given in Section 12.4. A general way is, 
of course, to use the least-squares principle and obtain, using the notation of 
Chapter 1.4，directly the partial sum of squares all other 

factorial effects), where X 爲 refers to the observation-block incidence matrix, 
which in this case is the treatment-block incidence matrix. 
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12.2.3 Use of Pseudofactors 

The method of constructing systems of confounding described in Section 12.2.1 
can be extended naturally to include pseudofactors (see Section 11.8). For 
example, this allows us to deal with asymmetrical factorials of the form 
2 n \ x 3 n2 x 4 ^ 3 x 5 n4 x 6 n5 (rii >0 ， / = 1 ， 2 ， 3, 4 ， 5)，which includes most 
practical situations. All we need to do is to rewrite this as 2 ni+2n3 ^~ n5 x 
3 W 2+/15 x 5^4 an( j remember that some interactions in the symmetrical com¬ 
ponents of this modified asymmetrical factorial are possibly main effects in 
the original asymmetrical factorial. To illustrate this, we consider the follow¬ 
ing example. 

Example 12.2 Suppose we want to construct a system of confounding for 
a 2 x 4 x 6 factorial in blocks of size 12 and avoid confounding main effects. 
Denote the three factors by Ai, A 2 , A 3 , respectively. Then we replace A 2 by 
its pseudofactors X\, X 2 each at two levels and A 3 by its pseudofactors Y\ 
with two levels and Y 2 with three levels. We then have four factors at two 
levels each and one factor at three levels, that is, a 2 4 x 3 factorial. In order to 
construct blocks of size 12, we need to confound three contrasts with blocks. To 
avoid confounding main effect contrasts, we need to consider interactions among 
factors from different pure systems. For example, we may confound A\X\, A\Y\ 
and hence X\Y\, which correspond to interaction contrasts belonging to A 1 A 2 , 
A 1 A 3 , and A 2 A 3 . The initial block in terms of the pseudofactors and the original 
factors is then as given below: 


Ai Zi X 2 Yi V 2 Ai A 2 A 3 


0 0 0 0 0 
0 0 0 0 1 
0 0 0 0 2 
0 0 10 0 
0 0 10 1 
0 0 10 2 
110 10 
110 11 
110 12 
11110 
11111 
11112 


0 0 0 

0 0 1 

0 0 2 

0 1 0 

0 1 1 

0 1 2 

1 2 3 

1 2 4 

1 2 5 

1 3 3 

1 3 4 

1 3 5 


The same design (apart from different block numbering) as produced by SAS 
PROC FACTEX is given in Table 12.4. □ 
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Table 12.4 2x4x6 Factorial in Blocks of Size 12 (Using the Method 
of Pseudofactors) 

options ate pageno=l ； 
proc factex ； 
factors A1 XI X2 Y1 ； 
blocks size=4 ； 

model est=(Al XI X2 A1*X2 X1*X2 A1*X1*X2 Y1 Y1*X2 Y1*A1*X1 
Y1*A1*X2 Y1*X1*X2 Y1*A1*X1*X2); 
examine design confounding ； 
output out=twodesn ； 
run ； 

factors Y2/nlev=3; 
output out=threeds 
designrep=twodesn ； 
run ； 


data threeds (drop=Xl X2 Y1 Y2 )； 
set threeds ； 
if Al=-1 then A1=0; 
if Xl=-1 and X2 = -l then A2 = 0; 
if Xl=-1 and X2 = l then A2 = 1 ； 
if Xl=l and X2=-l then A2=2 ； 
if Xl=l and X2 = l then A2 = 3; 
if Yl=-1 and Y2 = -l then A3 = 0; 
if Yl=-1 and Y2 = 0 then A3 = 1; 
if Yl=-1 and Y2=l then A3=2 ； 
if Yl=l and Y2=-l then A3=3; 
if Yl=l and Y2=0 then A3=4 ； 
if Yl=l and Y2 = l then A3 = 5; 
run ； 


proc print data=threeds ； 
titlel 'TABLE 12.4'; 

title2 '2x4x6 FACTORIAL IN BLOCKS OF SIZE 12'; 
title3 '(USING THE METHOD OF PSEUDO-FACTORS)'; 
run ； 


The FACTEX Procedure 


Design Points 

Experiment 

Number A1 XI X2 Y1 Block 


1 -1 -1 -1 -1 4 

2 - 1 - 1-11 1 
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Table 12.4 (Continued) 


3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 



1 

1 

1 

1 



1 

1 

1 

1 


1 

1 


1 

1 


1 

1 



1 

1 



4 

1 

3 

2 

3 

2 

2 

3 

2 

3 
1 

4 
1 
4 


Block Pseudofactor Confounding Rules 

[Bl] = X1*Y1 
[B2] = A1*Y1 


Design Points 

Experiment 

Number Y2 


1 -1 

2 0 

3 1 


Obs BLOCK A1 A2 A3 


1 1 

2 1 

3 1 

4 1 

5 1 

6 1 

7 1 

8 1 

9 1 

10 1 

11 1 


0 0 3 

0 0 4 

0 0 5 

0 13 

0 14 

0 15 

12 0 
12 1 
12 2 
13 0 

13 1 
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Table 12.4 (Continued) 
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12.3 THE GC/n METHOD 


The method of using generalized cyclic designs, as described in Section 11.9 for 
symmetrical factorial experiments, was extended to asymmetrical factorials by 
Dean and John (1975). The general ideas are the same, and the statements of 
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theorems and proofs have to be modified in an obvious manner. We shall not go 
into the details but rather give a brief description of the method as it applies to 
factorials of the form s\ x S 2 'x • • • ^ s n , that is, we have n factors Ai, A 2 ,.. 
A n with Ai having Si levels (the Si are not necessarily all distinct). We denote 
the total number of treatment combinations by ， = Yl^i s i- 

12.3.1 Description of the Method 

Let /x be the lowest common multiple (LCM) of ^ 1 , S 2 , .. s n , that is, 

[i = LCM^i, 52 ,..., s n ) (12.5) 

and for a given treatment combination x = (x\,X 2 ,..., x n ) with 0 < < St — 

1(/ = 1,2,... ,n) let 


h 


= HCF 


( \XX\ \1X2 



( 12 . 6 ) 


be the highest common factor of the quantities in parentheses. The treatment 
combination ux = (ux\, UX 2 ,..., ux n ) is obtained by reducing uxi mod Si(i = 1, 
2, .. n). Since /ix/ h = 0, a. cyclic Abelian group G of size / 1 /h is given by the 
treatment combinations 0, x, 2x, . . {fi/h — l)x, and x is called the generator 
of G. In the context of factorials the group G constitutes the initial block, Bi, of 
size k = ji/h. The remaining blocks are then obtained by adding each treatment 
combination to those in B\ and retaining only the set of b = t. h/fi different 
blocks. Each treatment combination will occur in only one block. 

In general, we may have g generators X\,X2, • • • ,x g with corresponding 
Abelian groups G\ ， G2, …， G g having only the zero element in common. Let 


hi = HCFU, 




si 


S2 




where Xij is the level of Aj in Xi(i = 1,2,..., g; j = 1,2,..., n). Further, let 
Q_i = m/ and q = nf=i qi. The group G is then formed by the elements 


U\X\ + U2X2 H - h u g x g {m = 0, 1,..., - 1; / = 1, 2,..., g) 


These q treatment combinations form the initial block B\. Cyclic development 
of B\ as described above yields then a single replicate GCIn design with blocks 
of size k = q and number of blocks b = t/q. 

Extensive tables of GCIn designs and their generators fox n <5, t < 56, k < 
30, Si <7(/ = 1, 2, 3, 4, 5) were given by Dean and John (1975) and additional 
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designs for n < 7, f < 200, k < 100, Si <7(/ = 1 ， 2, ... ， 7) by Lewis (1982). In 
all these designs main effects are unconfounded. We should point out that all the 
designs listed can be easily constructed by the methods described in Section 12.2. 


12.3.2 Choice of Generators 

To illustrate the GCIn method and discuss the possible choice of generators, we 
consider the following example. 

Example 12.3 Suppose we wish to construct a 2 x 5 x 6 factorial in blocks 
of size 10. Now LCM (2, 5,6) = 30. We have k = q = 10 = 2 • 5, suggesting 
that we should have two generators with q\ =2 and q 2 = 5. Now q\ = 30//zi, 
which implies 


hi 


HCF 30, 


30xn 30xi2 30^13 \ 
2 ， 5 ’ 6 ) 


15 


(12.7) 


Since 0 < x\\ < 1, 0 < x \2 < 4, 0 < < 5, it follows that (12.7) will be sat¬ 

isfied by xi = (1 ， 0, 3). Similarly, = 30//^ implies 


h2 = 


HCF 30, 


30^21 30^22 30X23 \ 


= 6 


which leads to X 2 = (0,1, 0) as a possible generator. The initial block B\ and 
the remaining five blocks are given in Table 12.5. □ 


12.3.3 Loss of Degrees of Freedom 

To obtain the loss of degrees of freedom due to confounding for the various main 
effects and interactions a z with z = (zi, Z 2 ,... ， z n ), the expression (11.137) in 
Corollary 11.3 ， 


秦 A 


v 



has to be modified by defining 


Zj 

Pi 


Si 


if ij = 1 and Zj = 1 
if ij ^ 1 and Zj = 1 
if~ = 0 


( 12 . 8 ) 
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Table 12.5 GC/3-Design for 2x5x6 Factorial 
in Blocks of Size 10 


1 

2 

3 

Block 

4 

5 

6 

000 

100 

Oil 

111 

002 

102 

103 

003 

114 

014 

105 

005 

010 

110 

021 

121 

012 

112 

113 

013 

124 

024 

115 

015 

020 

120 

031 

131 

022 

122 

123 

023 

134 

034 

125 

025 

030 

130 

041 

141 

032 

132 

133 

033 

144 

044 

135 

035 

040 

140 

001 

101 

042 

142 

143 

043 

104 

004 

145 

045 


(see Dean and John, 1975). Recall that ^ = 1 for all treatment combinations 
x = (ii — 1, 12 — 1, i n — l) in B\ and zero otherwise. Thus (12.8) depends 
only on the treatment combinations in B\ and hence can be evaluated easily for 
each z e Z. 


Example 12.3 (Continued) The y*(z) follow easily from inspection of the 
design given in Table 12.5. Every level of factor A\ occurs the same number 
of times in each block; hence v*(100) = 0. The same holds true for A 2 ； hence 
v*(010) = 0. For factor A 3 , however, only two levels occur equally often in each 
block, 0 and 3 in B\ and B 2 , 1 and 4 in B 3 and and 2 and 5 in Bs and B^. 
Hence only three contrasts belonging to main effect A 3 can be estimated; hence 
v*(001) = 2 = 5 — 3. For every contrast belonging to a 2-factor interaction each 
level combination has to occur the same number of times. It follows then that 
v*(110) = v*(011) = 0 but y*(101) > 0. To illustrate the method according to 
(12.8), however, we develop the v*(z) in Table 12.6. We obtain y*(001) = = 

2, y*(101) = ^ = 3. These 5 d.f. account then for the 5 d.f. among blocks. 

We note here that the design given in Example 12.3 is slightly better than what 
could have been obtained with the Kronecker product method of Section 12.2. 
Both result in confounding main effects (2 d.f. from A 3 vs. 1 d.f. from A\ and 
2 d.f. from A 3 ), but in this situation that seems to be unavoidable. This shows 
that the GC/n method leads to useful designs not available otherwise. 

For a detailed discussion of other properties of GC/n designs we refer the 
reader to Gupta and Mukerjee (1989). □ 


12.4 METHOD OF FINITE RINGS 

The methods of constructing systems of confounding for symmetrical factorials 
as discussed in Chapters 8 , 9, 10, and 11 (Sections 11.3, 11.7, and 11.8) are based 
on the arithmetic in Galois fields. To transfer these procedures to asymmetrical 



METHOD OF FINITE RINGS 


481 


Table 12.6 Determination of for the Design of Table 12.5 


i\ 

h 

h 




z 




100 

010 

110 

001 

101 

Oil 

111 

1 

1 

1 

1 

4 

4 

5 

5 

20 

20 

2 

1 

4 

-1 

4 

-4 

-1 

1 

-4 

4 

1 

2 

1 

1 

-1 

-1 

5 

5 

-5 

-5 

2 

2 

4 

-1 

-1 

1 

-1 

1 

1 

-1 

1 

3 

1 

1 

-1 

—1 

5 

5 

-5 

-5 

2 

3 

4 

-1 

-1 

1 

-1 

1 

1 

-1 

1 

4 

1 

1 

-1 

—1 

5 

5 

-5 

-5 

2 

4 

4 

-1 

-1 

1 

-1 

1 

1 

-1 

1 

5 

1 

1 

-1 

-1 

5 

5 

-5 

-5 

2 

5 

4 

—1 

-1 

1 

-1 

1 

1 

-1 


Sum 

0 

■ 

0 

20 

30 

0 

0 


factorials involves combining elements from different finite fields. Using results 
from abstract algebra (e.g., van der Waerden, 1966), White and Hultquist (1965) 
and Raktoe (1969, 1970) provided the foundation for generalizing the methods 
for symmetrical factorials to asymmetrical factorials (Hinkelmann, 1997). We 
shall give a brief description of the method (following Raktoe, 1969) without 
going into all mathematical details and proofs. 


12.4.1 Mathematics of Ideals and Rings 

First, we need the following mathematical results. Suppose we have m distinct 
primes p\, p 2 , … ， p m . Let GF(p/) be the Galois field associated with pi(i = 
1 , 2 ,..., m), the elements of which are the residues mod pi. Let q = YIT=\ Pi 
and R(q) be the ring of residue classes mod q. We denote by I(w) an ideal 
generated by an arbitrary element w of R(q). We then state the following results 
(for proofs see Raktoe, 1969): 

1. The element 


a i =n ^ _ p j = c j ~ p j wi* Cj = ]~[ Pi 

i=l i 參 j 

i^j 

in R(q) is prime to q and hence aj l exists. The aj’s belong to the multi¬ 
plicative group of nonzero divisors in R(q). 

2. The element bj = CjaJ 1 generates the ideal I(bj) in R(q). 

3. The element bj is the multiplicative identity in I (bj). 

4. The multiplicative identity element 1 in R(q) is the sum of the multiplicative 
identities in I (bj), that is, 1 = bj. 
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5. The ring R(q) is the direct sum of ideals I (bj )，that is, R{q) = 

6 . The field GF(pj) is isomorphic to the ideal I(bj); for x G GF(pj) and y e 
I(bj) the mapping a: GF(pj) /(Z?y) is defined by a{x) = bjX = y. 

7. Addition and multiplication of elements from different Galois fields x e 
GF(pj), x* e GF(pi) are defined by 

x + x* = <j(x) + mod q 

xx* = cr(x)cr(x*) mod q 

8 . Addition and multiplication of an element r g R(q) and an element x e 
GF(pj) are defined by 

r + x = r + a (x) mod q 
rx = rcr(x) mod q 

9. The ring R(q) is the direct sum of the GF(pj), that is, R(q) = Y^j=\ ^ 

GF(pj). 1 — 

To illustrate these results, we consider the following example. 

Example 12.4 Suppose we have p\ = 2, p 2 = 3. Then p = 6 and the gen¬ 
erators of the ideals are 

bi = 3(3 - 2 ) _1 =3.1 = 3 mod 6 

and 

b 2 = 2(2 - 3 ) _1 = 2(5 ) _1 =2-5=4 mod 6 
[(5 ) _1 = 5 since 5 • 5 = 25 = 1 mod 6 ]. We thus have the two mappings 


GF ⑵ — 7(3) and GF(3) ^ /(4) 



and the direct sums 

GF(2) 0 GF(3) = 7(3) 0/(4) = R(6) 

0 + 0 = a(0) + r(0) = 0 

1 + 0 = a(l) + r(0) = 3 

0+1 = a(0) + r(l) = 4 

1+1 = a(l) + r(l) = 1 

0 + 2 = a(0) + r(2) = 2 

1 + 2 = a(l) + r(2) = 5 


□ 
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12.4.2 Treatment Representations 

We now apply these concepts and results to asymmetrical factorials of the form 
p n x l x p ; 2 x •.. x pL m . For ease of notation we shall consider specifically the 
2 2 x 3 2 factorial. Generalizations will then become obvious. 

Ordinarily, treatment combinations would be represented in the form of 
quadruples, say x = (xu,x\ 2 , ^ 21 ^ 22 ). where xn,x \2 refer to the levels of the 
two-level factors, A\, A 2 say, and X 21 , X 22 refer to the levels of the three- 
level factors, B\, B 2 , say, that is, xu = 0, 1, X 2 i =0, 1, 2. We shall refer to 
this as the elementary representation of treatment combinations and denote it by 
T (x), remembering that the components in x are elements from the respective 
Galois fields. In order to perform arithmetic with treatment combinations, we now 
replace T(x) by a new representation T (v), where the components in v = (v\\, 
v\ 2 , V 21 , V 22 ) represent the corresponding elements from the respective ideals, 
that is, v\\, v\2 G 7(3), V21, V22 ^ /(4). The correspondence is as follows: 


T(x) T(v) 

XU X\2 X21 X 2 2 Vn Vn V 2 1 V 2 2 


0 0 0 0 
0 10 0 
10 0 0 
110 0 
0 0 0 1 
0 0 0 2 
0 0 10 


0 0 0 0 

0 3 0 0 

3 0 0 0 

3 3 0 0 

0 0 0 4 

0 0 0 2 

0 0 4 0 


2 2 3 3 2 2 


12.4.3 Representation of Main Effects and Interactions 

Concerning the representation of main effects and interactions, we recall that 
for a p n factorial an effect E a is represented by contrasts among the “levels” 
五二 —i of E a . Here the components in a’ = (ai, ai, .. ot n ) also 
take on the values 0 ， 1 , .. p — 1 with the understanding that the first nonzero 
a[ equals 1. We note also that for the p n factorial we can write an interaction 
in general as A\ l x A ^ 2 x •.. x A^ n , where the n are either 0 or 1 with the 
understanding that if zi = 0 then the letter A/ is dropped from the interaction. 
An interaction A/j x A / 2 x • • • x consists then of (p — l/ 一 1 components 
A/j A ^ 2 - - - where the at take on all possible values between 1 and p — l. 
Each such component represents p — 1 d.f. 

To generalize this representation for asymmetrical factorials, we now write an 
interaction for the 2 2 x 3 2 factorial as Aj 11 x A^ 12 x B^ 21 x B^ 22 , where and 
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z * 2 are either 0 or 3, that is, elements in 1(3), and and are either 0 or 4, 
that is, elements in 1(4), with the understanding that if z*j = 0 then that factor is 
dropped from the interaction. We may have, for example, x x which 
refers to the interaction among the factors A\, B\, and B 2 . The components of 
this interaction are, in analogy to the symmetrical component systems, given by 
Aj x x ^2 = The number of “levels，” L say, for these 

interaction components is determined by the fact that they involve factors with 
two and three levels. The L is equal to the number of residue classes for the direct 
sum of the ideals corresponding to the factors involved. In our case this is the 
number of residue classes for 7(3) (g) 7(4) = R( 6 ), that is, L = 6. More generally, 
if the interaction involves only i factors out of m, say FFi 2 , .. •， (l < m), 
then L is equal to the number of residue classes in ( 8 ) I(bi 2 ) 0 • • • 0 I(bi t ) 
that is generated by the greatest common divisor of bi x , bi 2 ， … ， bi r Trivially, 
this includes the main effects, that is, for main effect F[ with pi levels, L equals 
the number of residue classes for I(bi), which equals of course pi. 

The reason why we mention the levels of an interaction is that the total number 
of treatment combinations can be partitioned into L equal-sized sets (of treatment 
combinations) that can be identified with blocks and a recognizable system of 
confounding. Let us consider the 2 2 x 3 2 factorial. For it we have for the various 
main effects and interactions the following L’s: 

L{A\) = L{A\) = L{A\a\) = 2 

L(B\) = L(B^) = = L(S/b|) = 3 

L{A\b\) = L(A\b^) = … =L(Af Af )= … == 6 

where (p\, q> 2 , € /(3), rn, r ]2 e 7(4). For for example, this leads to 

the partitioning of the 36 treatment combinations (v\\, v\ 2 , V 21 , V 22 ) into six sets 
according to the equations. 

3v\\ + 3v\2 + 4i ； 2i + 2 i ；22 = ^ (12.9) 

where 5 = 0 ， 1 ， 2 ， 3, 4 ， 5， that is, 8 e R( 6 ). The sets are given in Table 12.7. 
Each set represents a block of size 6 . We note that the set 5 = 0 represents 
the IBSG and all other sets can be obtained easily by adding to each treatment 
combination in the IBSG a treatment not in the IBSG and not in any other 
already constructed set. As a consequence of constructing blocks according to 
(12.9), it follows that the interaction associated with (12.9), that is, 
is confounded with blocks. This statement, however, is to be understood only 
in the sense that together with also and that is, the 

components from the symmetrical systems, are confounded with blocks. This is 
so because to satisfy (12.9) for a given 8 we must have 


3v\\ + 3v\2 = S\ mod 6 


(12.10) 
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Table 12.7 Sets of Treatment Combinations 
Satisfying 3v\\ + 3v\2 + 4i ； 2i + 2v22 = 5 


8 

= 

0 



• — 

= : 

l 

8 

I = 

= 2 

0 

0 

0 

0 

3 

0 

4 

0 

0 

0 

2 

0 

3 

3 

0 

0 

0 

3 

4 

0 

3 

3 

2 

0 

0 

0 

4 

4 

3 

0 

2 

4 

0 

0 

0 

4 

3 

3 

4 

4 

0 

3 

2 

4 

3 

3 

0 

4 

0 

0 

2 

2 

3 

0 

0 

2 

0 

0 

4 

2 

3 

3 

2 

2 

0 

3 

0 

2 

3 

3 

4 

2 

8 = 3 8 = 4 8 = 5 

3 

0 

0 

0 

0 

0 

4 

0 

3 

0 

2 

0 

0 

3 

0 

0 

3 

3 

4 

0 

0 

3 

2 

0 

3 

0 

4 

4 

0 

0 

2 

4 

3 

0 

0 

4 

0 

3 

4 

4 

3 

3 

2 

4 

0 

3 

0 

4 

3 

0 

2 

2 

0 

0 

0 

2 

3 

0 

4 

2 

0 

3 

2 

2 

3 

3 

0 

2 

0 

3 

4 

2 


and 


4 v 2 \ + 2 v 22 = ^2 mod 6 


( 12 . 11 ) 


with 8 \ 82 = 8 . Thus we see that the system of confounding generated in this 

manner is the same as that obtained by using the Kronecker product method of 
Section 12.2 by considering confounding in the 2 2 and 3 2 systems separately 
and then combining them. This is true in general (see Voss, 1986) and thus may 
detract somewhat from the value of this method, but we shall discuss below that 
it provides for an extension of a useful result for symmetrical factorials that other 
methods do not do. 


12.4.4 Parameterization of Treatment Responses 

Recall from Chapter 11 that we expressed the true response a(x) of a treatment 
combination x as a linear function of interaction components E 01 as follows: 

a(x) = M + ( 12 . 12 ) 

CL 

where is the summation over all partitions a. To write down the counterpart 
to (12.12) for asymmetrical factorials, we consider again the 2 2 x 3 2 factorial. 
The partitions a are determined by the various main effects and interactions and 
are given in Table 12.8. Associated with each partition is an effect/interaction 
component and each component has a number, L, of levels as mentioned earlier. 
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Table 12.8 Partitions for the 2 2 

x 3 2 Factorial 


Interaction 

Partition 

Interaction/Modified 

Interaction 

Z* r = (Z*!, Z 21 , Z 22 ) a ' 

= (oin,a\2, ^ 21 ,^ 22 ) 

Component 

3 0 0 0 

3 0 0 0 

(A3) 6 (5 = 0, 3) 

0 3 0 0 

0 3 0 0 

(A^(5 = 0, 3) 

3 3 0 0 

3 3 0 0 

(A = 0,3) 

0 04 0 

0 04 0 

(Sf) 6 (5 = 0,4, 2) 

0004 

0 0 04 

(B*MS = 0,4,2) 

0 0 4 4 

0 0 4 4 

(SfS^(5 = 0, 4, 2) 


0 0 4 2 

(B*B^) s (S = 0, 4,2) 

3 040 

3 0 4 0 

(A 河)卜 

3 0 0 4 

3 0 0 4 

(A;) 5i - (Sf ) 知 

(5 = 0, 1,2, 3, 4, 5; 

5i = 0, 3; S 2 = 0,4,2) 

( 中 2 4 )l = (a]b 4 2 ) s - 

0 3 4 0 

0 3 40 

(A^) 5i - {B\)s 2 

(A 沖卜 (AlB 4 ^- 

0 3 0 4 

0 3 0 4 

(A 曼 ) 5l — (Sf) 52 
(A 星 S 2 4 ) 卜 (A\Bl) & - 

3 3 4 0 

3 3 4 0 

(Al) Sl - {B\)s 2 
(A\A\Bl)* = 

3 3 0 4 

3 3 0 4 

(A ⑷ 

(A^A3) 5i - (B\) Sl 

= 

3 0 4 4 

3 0 44 

(A\A\B A 2 ) & - 

(aIa 3 2 ) Si - (b% 2 

{A\B\B^ S = 


3 0 4 2 

(A>i — (SfS 2 4 ) 52 


{a\b\bI) 8 - 

(A]) 8i - 
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Table 12.8 (Continued) 


Interaction 

Z *， = (Z*p ^21 5 Z 22) 

Partition 

= (Ofn, «12, «21 ? ^ 22 ) 

Interaction/Modified 

Interaction 

Component 

0 3 44 

0 3 4 4 

= 

(A 3 2 B^B^)s- 

(A?)5, - ( 中 2 4 ) 52 


0 3 4 2 

(A|S^ 2 2 )| = 

(A 曼 

(Al) Sl - ( 中 !)5 2 

3 3 4 4 

3 3 4 4 

{A\A\B\B^ = 

(a3a|) 5i - (sfs 2 % 2 


3 3 4 2 

(A\A\B\Bl)* = 

(A\A 3 2 ) Si - (B*B^)s 2 


The levels are the elements <5 in 7(3), 7(4), R(6), respectively, as indicated in 
Table 12.8. We define the components as 

Ef = 

={average of all treatment responses a(v\\, v\ 2 , V 21 , V 22 ) 


with + a\ 2 V \2 + ^ 21^21 + 0^22^22 = 5(mod q)} 


— {average of all treatment responses} (12.13) 

where in our case q = 6. For interactions involving factors with different numbers 
of levels we now introduce what we shall call modified interaction components 
defined as 


(E a r s = wnm 
= 

-(A?H - (12.14) 


We shall refer to the partitions a involving only factors with the same number of 
levels as pure partitions and those that involve factors not all having the same 
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number of levels as mixed partitions. Let A\ denote the set of all pure partitions 
and A 2 the set of all mixed partitions. We can then write 

« ⑻ =^+ E + E ( 12 . 15 ) 

UeJ\ ，2 

where M represents the overall mean. Expression (12.15) is the extension of 
(12.12) for the asymmetrical factorial and is used in the same way as (12.12) is 
used for the symmetrical factorial. 

12.4.5 Characterization and Properties of the Parameterization 

The proposed parameterization (12.15) is a major result in that it unifies the theory 
of symmetrical and asymmetrical factorials, both with respect to the construction 
of design, that is, systems of confounding, and the analysis of such designs. 
Although we have only illustrated the idea in terms of a simple example, the 
result is true in general. The same holds for the following comments about 
further characterizations and properties of the modified components [defined in 
(12.14)] as part of the parameterization (12.15). 

1. Expressing each a(v) as (12.15) represents a reparameterization of the treat¬ 
ment effects. This reparameterization is singular since for each E a we have 

XU? = 0. 

2. For the modified components the values for 8 \ and 82 are determined uniquely 
from the addition table, for example, 


0 

3 


where the entries in the table are 8 = 8 \ 82 . 

3. For the modified components we have = 0 for each 82 and 

= 0for each^i. ' 

4. The degrees of freedom v(a) associated with the component E a are given by 

v{a) = L(E a ) — 1; for example, for we have LCA^A^B^B^)= 

6 and hence y(3, 3, 4, 2) = 5. 

5. The degrees of freedom y*(a) associated with the modified component 
(A?m 22 )* are given by 

v* (a \\a 12(X210122) = v (an a 12^21 ^22) — v(anQfi200) — y (00^21^22) 

=y(ofnQfi200) - v(00of21^22) 


02 


0 

4 

2 

0 

4 

2 

3 

1 

5 
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6 . If a block design is constructed by confounding A°^ n A ^ 12 B ^ 21 B ^ 22 and if 

A°^ n B°^ 2X B ^ 12 is a mixed component, it follows from comment 3 and 
the definition of the modified component that A°^ u A ^ 12 and B ^ 21 B ^ 22 are 
also confounded (as pointed out earlier). 

7. The component is estimated by 

= {mean of observed responses for treatments v satisfying 
a!v = 8} — {overall observed mean} 

8 . If the confounded interactions, that is, A°[ n A 0 ^ 11 B 0 ^ 11 B^ 22 , A^ 11 A^ 12 , B ? 21 
B^ 22 , are assumed to be negligible, then differences between two treatment 
responses (more generally, linear contrasts among treatment responses), say 
a(v) — a(w), can be estimated by 

a(v)-^w)= J2 [K'v - K'w] 

+ E - (E a )：, w ] (12.16) 

oceJ \，2 

where ^ refers to summation over all a except those belonging to con¬ 
founded interactions. Using (12.14), we find, for example, 

a(3040) — a(0000) = [(^)3 - (Aj)o + [(Afi4 產 )3 - (AjA^o 
+ [( 的 )4 — (5f) 0 ] + [(5f5 2 4 ) 4 - (Bf5 2 4 ) 0 " 

+ 卜 X)T — (4 吋)幻 + - (A\B A 2 )l_ 

+ 

+ [{aIaIb^x - {a\a\b\)1 

+ [{A\b\b^)\ - {A\b a x B a 2 )1 

+ [(A\B\Bl)\ - (a\b\bI% 

+ - {AW x B a 2 )1 
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+ \{A\b a x bI)1 - 

+ [(A\a\b a x B a 2 )\ - {A\A\B A x B A 2 )i\ (12.17) 


Observed means (based on different numbers of observations) are then sub¬ 
stituted for the true means to obtain 2X3040) —^(0000). 

9. It is obvious that variances for differences of the form [($)3 — (A^)o] can 
be written out easily in terms of the number of observations satisfying the 
equation 3i；n = 8 for a given 8 . If N is the total number of treatment com¬ 
binations, then the number of treatment combinations satisfying 3v\\ = 8 is 
given by N/L{A\). Hence 

var[(^)3-(A?)o]=2^a e 2 


which equals for the 2 2 x 3 2 factorial. 


For differences of the form 


(4?)3 -碗 )2 


ified components, it can be worked out that 


,that is, involving mod- 


var 


^ [l(A\B a 2 ) - L{A\) - L(5 2 4 )] 


which for the 2 2 x 3 2 factorial equals 




Finally, it can be shown that the covariances between differences of com¬ 
ponents are zero, for example, 

cov{ [(A]) 3 - ($) 0 ] ， [(^1)4 - (5^1)0] } = 0 
cov{ [(安 )3 —(安 )。] ’ [(AfBf)t - 0^)5] }= 0 
cov{[(AfBf)*- , [(AlAlB^)* - = 0 


10. Using (12.16) and the results in comment 9, one can obtain easily the vari¬ 
ances of the estimates of treatment contrasts, that is, 


var[a(i;) — ^(u;)] = var[r'(i;) — T(w)] 

=(sum of the variances of all estimated 


component differences) 
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for example, for (12.17) we find 
var[^(3040) —^(0000)]= 


2 

2 

2 

— 

+ — 

+ — 

18 

12 

12 

.1 

2 

2 V 

6 ~ 

~ 18 " 

12 人 


20 2 
18 ^ 


11. The sums of squares for unconfounded effects can be written out similarly 
to those for symmetrical factorials: 


SS(A^) 
SS(Sffif) 
SS(A ⑽ 


N 


UAi) 


E [d: 


8el(3) 


N 


E [ 碗 )5 


sem 


8eR(6) 


and because of comment 9 these sums of squares are orthogonal to each 
other. This makes testing of hypotheses concerning main effects and inter¬ 
actions easy. 


12.4.6 Other Methods for Constructing Systems of Confounding 

In addition to the method discussed in this section, alternative but essentially 
equivalent methods of constructing systems of confounding for asymmetrical 
factorials have been proposed. A method based on finite groups and rings is 
discussed by Banerjee (1970), Worthley and Banerjee (1974) (see also Raktoe, 
Rayner, and Chalton, 1978), and Sihota and Banerjee (1981), who extended the 
method to include any number of levels, thus avoiding the use of pseudofactors. 
A different technique based on the Chinese remainder theorem was introduced 
by Lin (1986) and extended by Huang (1989). 


12.5 BALANCED FACTORIAL DESIGNS (BFD) 

The incomplete block designs for asymmetrical factorials we have discussed so 
far have resulted in the confounding of certain contrasts belonging to some main 
effects and/or interactions. As a consequence the number of degrees of freedom 
for such effects has been reduced (possibly to zero). The lost degrees of freedom 
are said to be confounded with blocks. As we have seen in the previous sections, 
the confounding of interactions may sometimes also lead to the confounding of 
main effects, for example, with the methods of Sections 12.2 and 12.4. If we 
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confound, for example, a two-factor interaction involving factors with different 
numbers of levels, then the corresponding main effects are confounded also. 
This is generally an undesirable feature. If it cannot be avoided, the method of 
partial confounding, that is, using different systems of confounding in different 
replicates, provides some relief. This, however, may not always be feasible from a 
purely practical point of view because of the usually large number of experimental 
units required. 

In this section we shall discuss a method that results in a different kind of 
partial confounding. Rather than confound completely certain degrees of freedom 
in some replicates and not at all in others, the method to be described results in 
some loss of information for some degrees of freedom over the whole design. 
We shall illustrate this with a very simple example. 

Example 12.5 Consider the 2 2 x 3 factorial in blocks of size 6. Using the 
Kronecker product method of Section 12.2, a reasonable design may be obtained 
by using three replicates of two blocks each. If we denote the factors by Ai, A 2 , 
and B with two, two, and three levels, respectively, we may construct the design 
by confounding A\ in replicate I, A 2 in replicate II, and Ai A 2 in replicate III. The 
allocation of the treatment combinations is given in Table 12.9a. Compared to a 
RCBD with b = 3 blocks (replicates), the relative information for main effects 
A\, A 2 and the interaction A\A^ is | and 1 for all other effects. 

In Table 12.9b an arrangement due to Yates (1937b) is given that is the only 
arrangement not resulting in the confounding of main effects. Kempthome (1952) 


Table 12.9 2 2 x 3 Factorial in Blocks of Size 6 


Replicate 

I 


II 


III 


Block 1 

2 

3 

4 

5 

6 


a. 

Kronecker Product Method 



000 

100 

000 

010 

000 

100 

001 

101 

001 

Oil 

001 

101 

002 

102 

002 

012 

002 

102 

010 

110 

100 

110 

110 

010 

011 

111 

101 

111 

111 

Oil 

012 

112 

102 

112 

112 

012 



b. Yates Method 




000 

100 

100 

000 

100 

000 

110 

010 

010 

110 

010 

110 

101 

001 

001 

101 

101 

001 

011 

111 

111 

011 

Oil 

111 

102 

002 

102 

002 

002 

102 

012 

112 

012 

112 

112 

012 
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shows, from first principles, using the least-squares method, that the relative 
information for A\A^ is for A\A^B is and for all other effects is 1 . 

If we define the total relative loss in information ， TRL say, as the weighted 
sum of relative losses for the various effects, with the weights being the degrees 
of freedom for each effect, we find that TRL is the same for both arrangements, 
namely 

TRL(a) = l- i + l- i + l- | = l 
TRL(b) = 1 ■ ^+2- I = 1 

Even so, we would generally prefer arrangement (b) over arrangement (a) since 
its loss of information is mainly associated with the three-factor interaction. □ 

Both designs of Example 12.5 belong to the class of balanced factorial designs 
(BFD), a notion defined by Shah (1958) for the general factorial experiment. 
Although Shah (1958, 1960) pointed out the connection between BFDs and PBIB 
designs, Kurkjian and Zelen (1963) and Kshirsagar (1966) established the one- 
to-one correspondence between BFDs and EGD-PBIB designs as defined by 
Hinkelmann and Kempthorne (1963) and Hinkelmann (1964) (see Section 4.6.9). 
We shall now give some definitions of and results for BFDs and then give some 
methods of construction. 

12.5.1 Definitions and Properties of BFDs 

Suppose we have n factors A\, A 2 , .. with s\, S 2 , …， s n levels, respectively, 
where the Si (i = 1,2,, n) do not necessarily have to be all distinct. Following 
Kurkjian and Zelen (1963) and extending the notation of Section 11.9, we define 
the Si x Si matrix 

Mi ^Sil - 00' (12.18) 

and 

f y if z/ = 0 
[Mi if z/ = 1 

M z = M\ l x M z 2 2 x ••• x M z n n (12.19) 

for z = (zi ， Z 2 , • • • ， Zn) and n = 0, 1 (/ = 1,2,..., n). Let Z = {z} be the col¬ 
lection of all possible vectors z except z = (0, 0, .. . ， 0). Further, let t = Yl^i s i 
denote the number of treatment combinations. Then for any z G Z the interaction 
components in the n-factor factorial can be expressed in terms of the treatment 
effects r(x\, X 2 , .. x w )(0 < xi < Si — 1 ; / = 1 , 2 , .. n) arranged in standard 
order (see Section 11.9) in the vector r as 

a z = -M z r (12.20) 

t 
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Let 

0( z ) = ^ = ^( z )r (12.21) 

denote a normalized contrast belonging to a z (note that we are using a z for two 
different things but it should always be clear from the context what is meant). 
With 

r = CQ 

as a solution to the reduced normal equations we obtain the BLUE of 0(z) in 
(12.21) as 

0(Z) = l f z a z = l -H! z M z x (12.22) 

Following Gupta and Mukerjee (1989)，we then give the following definition. 

Definition 12.1 In a factorial design an interaction a z (z G Z) is said to be 
balanced if either (a) all contrasts belonging to a z are estimable and the BLUEs 
of all normalized contrasts belonging to a z have the same variance or (b) no 
contrast belonging to a z is estimable. □ 

If we adopt the convention that if a contrast is nonestimable its BLUE has 
variance oo, then parts (a) and (b) in Definition 12.1 can be combined, and it is 
in this sense that we shall use the definition. For an overall characterization of a 
factorial design, we then have the following definition. 

Definition 12.2 A factorial design is said to be a BFD if the interaction a z 
is balanced for every z G Z. 

It is of historical interest to note that Li (1944, p. 457) already refers to BFD 
(although not by that name) when he says, “An interaction which has more than 
1 degree of freedom should be confounded as evenly as possible.” □ 

Gupta and Mukerjee (1989) show that an interaction a z is balanced in the 
sense of Definition 12.1 if and only if the BLUEs of any two orthogonal con¬ 
trasts belonging io a z are uncorrelated. Recall that a similar result holds for 
factorial designs with OFS except that in that case BLUEs of contrasts belonging 
to different interactions are uncorrelated. Hence in a BFD with OFS we have 
between-interaction and within-interaction orthogonality (Gupta and Mukerjee, 
1989). As we have pointed out earlier (e.g., 1.7), orthogonality is a useful prop¬ 
erty as it leads to ease of analysis and ease of interpretation of the results from 
an experiment. We have seen in Chapter 11 that OFS requires a certain structure 
of C _ (see Theorem 11.12) and hence of C and ultimately NN f . The results of 
Shah (1958, 1960), Kurkjian and Zelen (1963), Kshirsagar (1966), and Gupta 
and Mukerjee (1989) have led to similar structures for NN’ (and hence C and 
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C_) for BFDs. We shall give a brief description of the major results, deferring 
details to the above references. 

The basic result for giving an algebraic characterization of a BFD with OFS 

is what Kurkjian and Zelen (1963) have called property A of the matrix NN f , 

<5. 

Let Dj 1 be an Si x Si matrix defined as 


Df 




if 8 i = 0 
if 8 i = 1 


for i = 1, 2 ,..., n, and 




with 5 = (5i, ^ 2 ,..., 8 n ). Denote the set of all S (with 5 / = 0, 1) by Z*= Z 
U{(0, 0,..., 0)}. We then state formally: 


Definition 12.3 (Gupta and Mukerjee, 1989) A t x t matrix G with t = 
YYi=i s i i s said to have property A if it is of the form 

G=J2 h(8)D s (12.23) 

SeZ* 

where the h(8) are constants depending on 8 . 

Now suppose the matrix C has property A, that is, 

C = rl - -NN' = ^ h(8)D s (12.24) 

k 


We want to show that if C is of the form (12.24) then all normalized contrasts 
belonging to a z are estimated with the same variance and the estimators of 


any two orthogonal contrasts belonging to a z are uncorrelated. Let be an 
(si — 1 ) x Si matrix of normalized orthogonal contrasts, that is, //p Si =0and 



H\ if zi = 1 
1 if zi = 0 


and 


H z = \\ xHf 
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A complete set of normalized orthogonal contrasts belonging to a z is then given 
by H z a z . More specifically, the contrasts belonging to the /?-factor interaction 
A/j x A/ 2 x • • • x Ai p (l < p < n), that is, to the Si'Sf Si p elements of a ?1 0 
cii 2 <S> … <S> a ip ， are given by 

(H' h x x ■.. x H’ ip ) (a h 0 a i2 ③.■•⑧ a ip ) 

and there are (si l — l)(si 2 — 1)... (Si p — 1) such contrasts. We can then prove 
the following theorem. □ 


Theorem 12.1 If for an s\ x S 2 'x • • • 'x s n factorial design the C matrix 
satisfies property A, then the design is a BFD and the variance-covariance 
matrix for H z ^ is given by 


v^r(H z a z )= —— : - lot 


(12.25) 


where r is the number of replications for each treatment combination and E(z), 
the efficiency factor for a z , is a constant depending only on z. 


Proof Let 


Then 


Now 

and 


C= h( 8 )D s 


8€Z* 


M Z C = h( 8 )M z D 8 


8eZ* 


^2 h(8)Y\^<M z i i Df 


8eZ* 


Mf D 》=(1 - Zi 8 i)s^~ Zi)Si 


~[ xMfDj =0 


so that (12.26) can be written as 


M Z C= ⑽) n x L (1 - zA)sf~ z)Si M] 

8eZ** i 

=rE(z)M z 


(12.26) 


(12.27) 
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where, by definition, 


rE{z)= Y, ⑽ ) ] 1 (1 -认 

SeZ** i 

and Z** = Z* - {(1, 1， .. • ， 1)}. It follows now from 

Cr= Q 


(l-zM 


and (12.27) that 


can be written as 


and, using (12.20), 


so that 


Then 


M z Cr = M z Q 


rE(z)M z r = M Z Q 


rtE^z^a 7, = M z Q 


HZ ^ z = vik) HZMZQ 


\ax(H z a z ) 


and using (12.27) again 


[rtE(z)] 2 


H z M z C{M z )'(H z )'a^ 


war(H z a z ) = — H z M z (M z )'{H z )' 


t 2 rE(z) 


To simplify (12.29), we write 


H Z M Z = (H\ l x x ... x H z n n ) {M\ l x Aff x ... x M z n n ) 
=x (H z 2 2 M z 2 2 ) x ... x (H z n n M z n n ) 

From the definitions of Hf and Mf it follows that 


H Z ； M\ 


\^ Si ^zi=0 

[siH^ if z,- = 1 


(12.28) 


(12.29) 
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and 


H z M z (M z Y(H z y = (M z H z )(M z H z y = ]~[ s)~ Zi ]~[[5f ] 2 / (12.30) 

i i 

Substituting (12.30) into (12.29) yields 

var(H z a z ) = — FT s] +Zi I^ 

t 2 rE(z) 11 ; e 

l 

trE(z) 11 1 e 


i 




Y\isl~ Zi rE(z) 

using the fact that t =「[/ &•. This is the desired result. 


□ 


We comment briefly on the efficiency factors E(z) = (zi ， Z 2 , ...， z n ) associ¬ 
ated with the interaction a z . Recall that the efficiency factor of an IBD is defined 
as the ratio of the average variance for simple treatment comparisons using an 
RCBD over the corresponding variance for the IBD. We saw in Chapter 4 that if 
the IBD is a PBIB(m) design then we may have up to m different efficiencies E\, 
五 2 , .. . ， E m ，where Ei refers to the efficiency factor for comparing two treatments 
that are ith associates (/ = 1, 2, ..m). In the context of the BFD we have up 
to 2 n — 1 different efficiency factors E (z) associated with treatment comparisons 
belonging to the various interactions (which in our terminology also include the 
main effects). Let i f z t a z {i = 1,2, .. •， y(z)) be a set of linearly independent nor¬ 
malized contrasts belonging to a z [e.g.，the rows of H z constitute such a set of 
i r z t , where v(z) represents the number of degrees of freedom for a z ]. We know 
that with an RCBD each such contrast is estimated in terms of treatment means, 
averaging over the levels of those factors not involved in the interaction, that is, 
the factors for which zt = 0 in a z . With r replications (blocks) we then have 


var RCB D( 《 ,〆）= 



From (12.25) we have 


\axmD(l' zi a z )= 


rUi^Eiz)^ 


(12.31) 


(12.32) 


The ratio of (12.31) to (12.32) is then equal to E(z). We shall return to evaluating 
E(z) shortly. 
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We have shown in Theorem 12.1 that if for a factorial design the C matrix has 
a certain structure, as defined by property A, then the design is a BFD. Kshirsagar 
(1966) proved the converse so that we can now state the following theorem. 

Theorem 12.2 For an 5 i x ^2 x • • • x ^ factorial design to be a BFD with 
OFS, it is necessary and sufficient that the C matrix of the design satisfies 
property A. 


We proved the sufficiency part in Theorem 12.1 and for the necessity part we 
refer the reader to Kshirsagar (1966) (see also Gupta and Mukerjee, 1989). 

12.5.2 EGD-PBIBs and BFDs 

To provide further insight into BFDs, we recall that the association matrices B Y 
for an EGD-PBIB design can be expressed as Kronecker products of (the trivial) 
association matrices for a BIBD, Bq = I and B\ = 33 f — I (see Section 5.4). 
More specifically, using the notation of this section, we define 

W ，、卜 for y, = o 

- I Si for Yi = 1 

Then the association matrix, B y , for yth associates [with y = 72 ， ...， Yn) 

and yi = 0, 1 for / = 1, 2, .. n] can be written as 

B y = x B^ 2 x- • x B y n n 

with 

NN，= E x ^ Br 

yeZ* 

or 

C =J2 c(y)By (12.33) 

y^Z* 

where 


k- 1 

c (0, 0, … ， 0)= 入(0, 0, ... ， 0)—— 

k 

c(y) = -yHy) for all y ^ ( 0 , 0 ,..., 0 ) 
k 

It is not difficult to speculate that there exists some relationship between BFDs 
and EGD-PBIB designs. Indeed, Kshirsagar (1966) proved the following (for an 
alternative proof see Gupta and Mukerjee, 1989). 

Theorem 12.3 Every n-factor BFD is an EGD-PBIB {2 n — \) design. 

This result is useful for several reasons, two of which we shall discuss in some 
detail: 
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入 (001) = 1 
入 (101) = 2 
入 (Oil) = 2 
Mill) = 1 


Further, we can express each B Y easily in terms of the Df as used in defining 
property A [see (12.23)]. We note that 

= B 2 = I 2 = = D 2 

B\=B\ = 32^2 - Ii = D\ - D\ = D\ - D° 2 

and 

= I 3 = ^ 3 , ^3 = 33^3 — = D\ — 

so that, for example, 

j 5 (ooo) = D° 2 x 

B( 100 ) = D\ x D° 2 x D^- x D^x D° 3 
b (iio) = d\ x D\x D^- D\ x D^x D° 3 
-x D 1 2 x D° 3 +x D^x 

and so on. Collecting terms, we can rewrite (12.33) as 

c = (3 _ 8) 2)(000) _ 4 D (100) _ 40(010)-2^(110) 

+ 12)(001) _ ^ D (101) _ 去 £)(011) _ ^ D (111) 

that is, in the form (12.23) with 

"(000) = 誓 ft(100) = -\ "(010) = —I "(110) = —I 

"(001) = 1 ft(101) = -i h(pn) = -h h{\U) = -\ 


i. For any given EGD-PBIB it is easy to obtain the efficiency factors; 

ii. Existing methods for constructing EGD-PBIBs can be used to obtain BFDs. 

Let us consider first i by returning to Example 12.5. 

Example 12.5 (Continued) For arrangement b in Table 12.9 it is easy to 
verify, using the association scheme for EGD-PBIBs, that 


3 0 0 3 


\17 \—/ \ly 

o o o o 

C-—-3 C33 1a 1i 

c — 3 1i cu IX 

/IV /V /V /V 

入入入入 
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Using (12.28), we can now obtain the efficiency factors E(z). Let us write 


with 




o 


for n = 0, 
for Zi = 1, 
for Zi — 0, 
for Zi = 1, 


< 5 / = 0 

8i=0 

Si = 1 
5 , = 1 


Then (12.28) can be rewritten as 


n 

r£"(zi ， Z2 , …，〜) =^2 "(zi, （ 12.34) 

8eZ** i=\ 


So, for example, 


Hence 


6 • 3E(U0) = 10^1(1,0) W 2 (1,0) W 3 (0, 0) 
-4Wi(l, 1) W 2 (l,0) W 3 (0, 0) 
-4Wi(1,0) W 2 (l, 1) W 3 (0, 0) 
1 ) W 2 (h 1) W 3 (0, 0) 
+2Wi(l,0) W 2 (l,0) W 3 (0, 1) 
-Wi(l,l) W 2 (l,0) W 3 (0, 1) 
-Wi(l,0) W 2 (hl) W 3 (0, 1) 
=10 + 2.3= 16 

£( 110 ) = | 


the same value we obtained earlier and referred to as relative information for the 
two-factor interaction A\ x A 2 . 

If we define the efficiency factor for a BFD as 


^=En 

zeZ i 


(Si- l) Zi E(z) 


TRL 



(12.35) 


with TRL being the total relative loss defined earlier, we obtain, for both designs 
of Table 12.9, E = |j. To choose among competing designs, it is therefore in 
general advisable to consider not only E for each design but also the individual 
E(z) and then choose the design that is most appropriate for the experiment 
under consideration. □ 
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12.5.3 Construction of BFDs 

We shall now turn to the construction of BFDs or, alternatively, to the con¬ 
struction of EGD-PBIB designs. For some of these methods we shall refer 
to Chapter 5 and provide here just an illustration, for other methods we shall 
give a brief description of their special features in the context of factorial 
experiments. 

12.5.3.1 Kronecker Product Designs 

The method is described in Section 5.5, and we know from Chapter 4 that the 
Kronecker product of BIBDs is an EGD-PBIB design. As an illustration let us 
consider Example 5.5 to construct a BFD for a 4 x 3 factorial in blocks of size 6 . 
Using the SDP notation of Section 11.9, the treatment combinations are written 
in the order (0, 0), (0, 1), (0, 2), (1, 0), (1, 1), (1, 2), (2, 0), (2, 1), (2, 2), (3, 
0), (3, 1), (3, 2). The 12 blocks are then as given in Table 12.10. Following our 
earlier discussion, it is easy to work out that E(l, 0)= 兹，五 (0, 1)= 髮 ， and 
£"(1,1) = II， and the overall efficiency factor is E = l — 

12.5.3.2 GD-PBIB(2) Designs 

If in an EGD(3)_PBIB design we have 入 （ 10)= 入 (11), then the design reduces 
to a GD-PBIB(2) design with parameters n\ = n(0l), = n(l0) +"(11) and 

= 入 (01) ， 入 2 = 入 （ 10)= 入 （ 11). It is therefore possible to obtain a BFD for a 
t = s\ x S 2 factorial if there exists a GD-PBIB design for t = s\S 2 treatments. 


Table 12.10 4x3 Factorial in Blocks of 
Size 6 Using a Kronecker Product Design 


1 

2 

Block 

3 4 

5 

6 

00 

00 

01 

00 

00 

01 

01 

02 

02 

01 

02 

02 

10 

10 

11 

10 

10 

11 

11 

12 

12 

11 

12 

12 

20 

20 

21 

30 

30 

31 

21 

22 

22 

31 

32 

32 



Block 



7 

8 

9 

10 

11 

12 
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All we need to do is to arrange the factorial treatment combinations in the rect¬ 
angular array 


(0,0) (0，1) … (0, - 1) 

(1,0) (1,1) … (1,5 2 -1) 


(U — 1 ， 0) (s\ — 1, 1) 


(^1 ~l,s 2 - 1 ) 


and use the association scheme for a GD-PBIB design: Any two treatments in the 
same row are first associates and any two treatments in different rows are second 
associates. We can then use any suitable GD-PBIB design to obtain a BFD. This 
method was proposed by Kramer and Bradley (1957) and Zelen (1958). We shall 
illustrate this method for the 4x3 factorial in blocks of size 6 using design S 
27 of Clatworthy (1973). 

Using the following correspondence between ordinary treatments and factorial 
treatments, 


1 

5 

9 

00 

01 

02 

2 

6 

10 — 

10 

11 

12 

3 

7 

11 

20 

21 

22 

4 

8 

12 

30 

31 

32 


the BFD is given in Table 12.11 [we note here that this labeling does not agree 
with that obtained from the SDP of (0 1 2 3)’ and (0 1 2)’ but rather it agrees 
with the labelling for GD-PBIB designs as given by Clatworthy (1973) and that 
for EGD-PBIB designs]. With 入 0 = r = 3, = 3, 入 2 = 1 for the GD-PBIB and 
hence 入 (00) = r = 3, 入 (01) = 3, 入 (10)= 入 (11) = 1 for the EGD-PBIB we can 
obtain the efficiency factors in the usual way except for one small change. From 


Table 12.11 4x3 Factorial in Blocks of 
Size 6 Using a GD-PBIB 


Block 
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the structure of the GD-PBIB design we know that we can write 


Bo = It 

忍 1 = (^52^52 ~ x ^ SI 
= 0 S2 3 f S2 X I Sl - It 

= ^52^52 X (llXl _ ’ 5 1) 

= 0 t 0 f t -0 S2 0 f S2 xI Sl 
and 

B 0 = B m = D^x D° { 

B { = B m = D\x D° { -D^x D° { 

B 2 = B (l0) + B (n) = D\x D\- D\x D° { 


Notice that because of the labeling the order of the D matrices is reversed (this 
is, of course, essential only for D\ x D?). Hence, in general, 

C=(r - DUD 0 ,- xD° { - fD\xD\ 

so that, using (12.34) and the properties of the Wi, 

rE(l, 0) = (r - WKl, 0)W 2 (0,0) 

- 入 1 7 入 2 Wi(i’o)w 2 (o, 1) 


and 


rE(0, 1)= 




叭 (0, 0)W 2 (1,0) 


rE(l,l) = 



W 2 (l, 0)^1 (1,0) 


Then, for the specific example of Table 12.11 we find 

£(1,0) = f E(0, 1) = 1 £(1,1)= 1 

that is, we have full information on the main effect A 2 and the interaction A\ x 
A 2 and two-thirds information on the main effect A\. 

The GD-PBIB design for t = t\t 2 treatments can, of course, also be used for 
factorials with more than two factors, that is, for an x 乃 x •. • x & factorial, 
as long as Si { Si 2 - - - Si m = t\ for any m factors (m < n) and sj x sj 2 - - - sj e = t 2 for 
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the remaining I = n — m factors. Thus the design in Table 12.11 can also be 
used for the 2 x 2 x 3 factorial in blocks of size 6 with t\ = 4 = S 1 S 2 = 2-2. 
All we need to do is relabel appropriately the levels of A\. It follows then 
immediately that 


£( 100 ) = £( 010 ) = £( 110 ) = § 


and all other E(z) = 1. 


12.5.3.3 Other Methods 

The methods of constructing EGD-PBIB designs given by Aggarwal (1974) and 
Chang and Hinkelmann (1987) (see Section 5.5) can be used to obtain different 
systems of confounding for the x ^2 x • • • x ^ factorial. If the factors are 
labeled such that < 52 < • • • < then Aggarwal’s method leads to BFDs 
with block size k = s\. The method by Chang and Hinkelmann is somewhat 
more flexible in that it can be used to obtain BFDs with block sizes k = Si 
(/ = 1, 2, ..w — 1). It leads, however, to disconnected designs, which means 
that some effects are completely confounded with blocks. Both methods can 
be modified easily to generate a larger class of BFDs by combining several 
factors into “superfactors” and thereby changing the order of the levels, that is, 
some si ^ s\ becomes the smallest number of levels, or creating new levels, 
that is, s n may no longer be the largest number of levels. The first modification 
is useful for the Aggarwal method, the second for the Chang and Hinkelmann 
method. 

There exist other ad hoc methods used by Yates (1937b) and Li (1944) for 
specific asymmetrical factorials. For example, the design for the 2 2 x 3 factorial 
given in Table 12.9b can be obtained by combining the levels of factor A 3 with 
treatment combinations for factors A\ and A 2 determined by the equations 


and 


^+^2 = 0 : ( 0 , 0 ), ( 1 , 1 ) 
X!+x 2 = I ： ( 1 , 0 ), ( 0 , 1 ) 


If we denote {(0, 0), (1, 1)} by a and {(1 ， 0) ，（ 0, 1)} by 卢 ， then the design can 
be written in compact form as follows (Kempthome, 1952): 


Block 


1 

2 

3 

4 

5 

6 

a0 

po 

po 

a0 

po 



al 

Oil 



al 

P2 

a2 

P2 

a2 

a2 
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Rather than describe the ad hoc methods in detail, we reproduce some of the more 
useful designs given by Li (1944) for up to four factors with two, three, and four 
levels. They are given in Appendix D together with some relevant parameters. 

Other methods of construction have been proposed by Kishen and Srivastava 
(1959) and Muller (1966). They consider s x s\ x • • - x s n -\ factorials, where s 
is a prime or prime power and various block sizes. 

It is quite obvious that BFDs require in general several replications that may 
lead to more blocks than is feasible from a practical point of view. If one is willing 
to give up the property of balancedness, then useful designs can be obtained by a 
suitable reduction of the number of replications. Such possibilities are mentioned 
for some situations in Appendix D. Even though these designs are no longer 
BFDs, they yield full or partial information on all degrees of freedom. This may 
make these designs more useful than comparable designs obtained by methods 
discussed in Sections 11.2 and 11.3. 



CHAPTER 13 


Fractional Factorial Designs 


13.1 INTRODUCTION 

So far, we have considered possible arrangements of the complete factorial sys¬ 
tem; that is, we have considered only arrangements in which each individual 
treatment combination of the factorial set is represented the same number of 
times. For example, with 4 factors each at 2 levels, we have discussed arrange¬ 
ments of the 16 treatment combinations in blocks of size 2, blocks of size 4, 
8x8 squares, and so on, but always with the restriction that each of the 16 
combinations is to be tested an equal number of times. Such a restriction is 
practicable with a small number of factors, but, when we consider the case of, 
say, 10 factors each at 2 levels, it would result in the necessity of testing 1024 
combinations or a multiple of these. The main reason for imposing the restric¬ 
tion is that it results in the estimates of effects and interactions having maximum 
precision and being uncorrelated. Thus, to take a simple example, suppose we 
are evaluating 2 factors, A and B, each at 2 levels, and test the treatment com¬ 
bination (1), a, b, ab with r\, ri, n, r\ replications, respectively. Then using 
the definitions of Section 7.2.1, each effect and interaction is estimated with 
variance 


a] ( 1 1 1 1 \ 

4 \r\ n n r\) 

but the estimates are correlated thus: 


4 

cov (A, AB) = 



「 1 

l 

l 11 

— 

. — 

- 1 — 

Ln 

n 

厂 3 厂 4 」 

「 l 

l 

1 1 

— 

— 

H -1 -一 


r\ 厂 2 厂 3 r\ 
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and 


COY 


(b,ab) 


1 1 1 1 " 

.n r 2 r 3 厂 4 _ 


These covariances will be zero if r\ = r 2 = = r\. Furthermore, if we fix ( 厂 i + 

厂 2 + 厂 3 + 厂 4 )，the total number of observations, we obtain minimum variance for 
each of the effects and interaction with n = r 2 = = r\. 

As we have seen throughout, the general problem of the design of experiments 
is that we are given a model and we wish to determine parametric functions of 
such a model with as low a variance as possible. There are two methods by which 
this may be accomplished: namely, by choosing the pattern of observations, for 
example, equal r’s above, or by reducing the error variance by choice of the 
error control design. 

The question we ask in this chapter is whether it is always necessary to test 
all the factorial combinations equally frequently or whether we can omit some 
of them. The question is of considerable relevance, for factorial experiments are 
most appropriate for exploratory research, and in such research the number of 
possible factors that should be tested is by no means so small as 2 or 3. For 
example, in research into the possible importance of the various vitamins in the 
nutrition of an organism, the number of factors that might be used is at least of 
the order of 10. It is, furthermore, important to use a factorial system because 
we cannot assume all interactions to be negligible. The testing of 1024 treatment 
combinations may be virtually impossible from the practical viewpoint. If the 
whole 1024 combinations were tested, the subdivision of the 1023 treatment 
comparisons would be 


Main effects 

10 

2 -factor interactions 

45 

3-factor interactions 

120 

4-factor interactions 

210 

5-factor interactions 

252 

6 -factor interactions 

210 

7-factor interactions 

120 

8 -factor interactions 

45 

9-factor interactions 

10 

10 -factor interactions 

1 

Total 

1023 


It may well be reasonable to assume that high-order interactions are negligible, 
and, if, for instance, all interactions involving 3 or more factors could be ignored, 
the testing of the 1024 combinations would give an estimate of error based on 
about 950 degrees of freedom. The accuracy that would result for the estimation 
of main effects and interactions (a variance of /256 where is the experi¬ 
mental error variance) might well be unnecessarily high. We may then ask what 
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information can be obtained from the testing of a lesser number of combinations. 
We shall, in fact, find that information on main effects and interactions for all 
the 10 factors can be obtained under some mild assumptions from the testing of 
512, 256, or perhaps even 128 of the total of 1024 combinations. The general 
process by which this is accomplished is known as fractional replication. The 
resulting design is called a fractional factorial design, which was introduced first 
by Finney (1945). 

For an annotated bibliography of applications papers using some of the con¬ 
cepts developed in the following chapters, see Prvan and Street (2002). 


13.2 SIMPLE EXAMPLE OF FRACTIONAL REPLICATION 

We shall illustrate the concept of a fractional factorial design in terms of a simple, 
though somewhat unrealistic, example. Suppose we are testing three factors A, B, 
and C, each of two levels, and for some reason we are able to observe only four of 
the eight possible treatment combinations. Let these four treatment combinations 
be a, b, c, and aloe. The question then is: What information can we obtain from 
these four combinations? This question can be answered easily by referring to Table 
13.1，which shows the relationship between the true yields of these four treatment 
combinations and the effects and interactions for the ordinary 2 3 design. It shows 
that, for these four treatment combinations, M and jABC have always the same 
sign, and so have and \BC, and \AC, 士 C 1 ，and \AB. It will therefore be 
impossible to separate the mean from the ABC interaction, the A effect from the 
BC interaction, the B effect from the AC interaction, and the C effect from the AB 
interaction. In fact, we have the following four estimating relations: 

^ (a + Z? + c + a be) estimates M + ^ABC 

— b — c -\- abc) estimates A + BC 

\{—a b — c abc) estimates B + AB 

— b c abc) estimates C + AB 

where a, b, c, abc are now the observed yields. We may say that, with these 
four observations only, A is completely confounded with BC because there is 


Table 13.1 Relationship Between True Yields and Effects and 
Interactions for 2 3 Design 

True Yield M \A \B \AB \C \AC \BC \ABC 
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no possibility of estimating A alone and BC alone but only their sum. We also 
express this by saying that A is an alias of BC (and vice versa) or that A is 
aliased with BC. Similarly, B is completely confounded with AC, C with AB, 
and M with ABC • This confounding, however, need not cause the experimenter 
any worry if he is prepared to assume that the interactions are negligible: Then 
the second, third, and fourth comparisons above may be used to estimate the 
three main effects, and the estimates are uncorrelated. In this simple example, 
there is not, of course, any possibility of estimating error variance, and we would 
not, generally, use this design for any practical purpose. It serves, however, to 
bring out the main idea of fractional replication, namely that a suitably chosen 
subset of the full factorial set can provide worthwhile information. The above 
example utilizes a ^ replication of the 2 3 system because we have tested only 
four of the eight combinations. 

We could equally well have used the treatment combinations (1 )， ab, ac, and 
be, and it may be verified that the estimating equations will be 

M- \ABC = i[(l) -\-ab-\-ac + be] 

A — BC = +[—(1) -\- ab -\- ac — be] 

B _ AC = ^ [ — (1) ~h cib _ uc H - 

C — AB = ^[—(1) — ab ac — be] 

The confounding among the mean, main effects, and interactions is the same as 
before, only the functions that can be estimated are different. 

The dominant feature of this example is the choice of the interaction ABC 
and the selection of either the treatments entering positively in this interaction or 
those entering negatively, or alternatively, either those that satisfy the equation 

xi X 2 X 3 = 1 mod 2 
or those satisfying the equation 

x\ X 2 X 3 = 0 mod 2, respectively 

For most purposes we may disregard which half of the treatment combinations 
we choose because we shall always choose an interaction in such a way that 
those interactions that are confounded with the effects or interactions we wish to 
estimate are assumed on the basis of prior knowledge to be negligible. 

If we use the equality sign to denote “confounded with,” we may write the 
confounding relations above in the form: 

A = BC 
B = AC 


C =AB 


(13.1) 
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and, if by convention we denote M by /, also 

1 =ABC (13.2) 

We note the equalities (13.1) are obtained from (13.2) by regarding the relation¬ 
ship (13.2) as an algebraic identity and I as unity and using multiplication with 
the rule that the square of any letter is to be replaced by unity. Thus, (13.2) 
multiplied by A gives 

A = A 2 BC = BC 

when multiplied by B gives 

B =AB 2 C =AC 
and when multiplied by C gives 

C = ABC 2 = AB 


In Table 13.2 we show how this design and the alias structure can be obtained 
using SAS PROC FACTEX. We comment briefly on the input statements and 
the output: 

1. The number of treatment combinations to be used can be specified either 
by “size design = # of treatment combinations” or “size fraction = denom¬ 
inator of fraction.” 

2. The model can either be specified by listing the effects we wish to estimate 
or by giving the resolution of the design (see Section 13.3.2). 

3. The factor confounding rule is equivalent to the defining relation in the 
sense that C = A * 5 is formally identical to C 2 = A * B * C and C 2 = /; 
operationally C = B indicates how from a complete 2 2 factorial the 
levels of the factor C are obtained, namely by multiplying the levels of 
factors A and B into each other when the levels are denoted by — 1 and +1. 


If, instead of a, b, c, abc, we had chosen the treatment combinations that enter 
positively into AB, namely (1), ab ， c, and abc, we can obtain the confounding 
relationships formally by multiplying each effect or interaction into 


I = AB 

Thus 

A = B = B 
C = ABC 
AC = A 2 BC = BC 


(13.3) 


(13.4) 
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Table 13.2 1/2 Fraction of 2 3 Factorial With Defining Contrast I = ABC 

options nodate pageno=l ； 
proc factex ； 
factors ABC; 
size design=4 ； 
model est=(A B C )； 

examine design aliasing confounding ； 
titlel 'TABLE 13.2'; 

title2 '1/2 FRACTION OF 2**3 FACTORIAL 1 ; 
title3 'WITH DEFINING CONTRAST I^ABC 1 ; 
run ； 

proc factex ； 
factors ABC; 
size fraction=2 ； 
model res=3 ； 

examine design aliasing confounding ； 
run ； 


The FACTEX Procedure 
Design Points 

Experiment 

Number ABC 


1 -1 -1 
2-11 

3 1-1 

4 11 

Factor Confounding Rules 
C = A*B 



Aliasing Structure 

A = B*C 
B = A*C 
C = A*B 


The equations (13.4) we interpret then by saying that A and B are confounded, 
C and ABC are confounded, and AC and BC are confounded. Of course, (13.3) 
means that M and AB are confounded. Alternatively, we can say that with the 
four observations above we can only estimate M + \AB, A B, C ABC, and 
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AC + BC. This can be verified easily by using an argument similar to the one 
we used at the beginning of this section. 

In each of these two examples we have used a \ replicate of the full set 
of factorial combinations. The information obtained from these two designs is 
quite different: If we can assume that the three factors do not interact, the first 
replicate is a satisfactory design (apart from the fact that no estimate of error can 
be made) because we can estimate each of the three main effects A, B, and C. 
In the second j replicate, however, the effect A is confounded with the effect 
B ， and this is generally undesirable. 

The crucial part of the specification of the design is clearly the choice of the 
relationship I = ABC or / = AB. This relation is known as the defining rela¬ 
tion or identity relationship. Once this relationship is chosen, we can specify the 
functions of the parameters that can be estimated, and the choice of an identity 
relationship is based on this fact. Thus, we would typically not use a relationship 
that results in confounding of main effects with each other. 

In passing we note that the defining relation can be written in such a way that 
it identifies uniquely the two complimentary ^ replicates, for example, 

I = +A5C, and I = - ABC 

the first denoting the ^ replicate based on the treatment combinations entering 
positively into ABC ， and the second based on those entering negatively into 
ABC. The specific form of the defining relation also tells us which functions 
of the parameters can be estimated; for example, A + BC from the first and 
A — BC from the second ^ replicate. In either case, A and BC are confounded, 
and so forth. The totality of all confounding relationships is also referred to as 
the alias structure of the fractional factorial design. 


13.3 FRACTIONAL REPLICATES FOR 2 n FACTORIAL DESIGNS 

It is clear from the discussion in Section 13.2 that for a j replicate of a 2 n factorial 
design will generally be determined by the identity relationship of the form 

I = A\ A 2 ••- A n (13.5) 

so that main effects and low-order interactions are aliased with interactions of as 
high an order as possible. The actual choice of the specific ^ replicate based on 
(13.5) is often made so that it contains the treatment combination (0, 0,..., 0), 
which is often the control. 

13.3.1 The y Fraction 

In a practical situation it may be that a ^ replicate is still too large a fraction 
to be accommodated in a reasonable experimental design. Thus smaller fractions 
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may have to be considered, such as a 会 fraction of a 2 n factorial design. We 
denote such a design by 2 n ~^. The treatment combinations of a 2 n ~^ design are 
obtained by specifying i equations: 

otnxi + a\ 2 X 2 H - h ot\ n x n = 8\ 

0i2\X\ + 0122X2 H - h 0i2nX n = 82 (13.6) 


Oti\X\ + OiaX2 H - h OilnXn = ^ 

where the a- = (an ， otn, ..., oti n ) represent independent interactions E ai (i = 
1 ， 2, ...，£) (see Section 8.2) and the 5/( / = 1, 2,are either 0 or 1 mod 
2. It follows then that there are exactly 2 n ~ £ treatment combinations that satisfy 

(13.6) and these constitute the specified 2 n ~^ fractional factorial. We note the 
following: 

1. All treatment combinations satisfying (13.6) enter with the same sign into 

E^(i = 1,2, ^ i 

2. Any treatment combination that satisfies (13.6) also satisfies any linear 
combination of the equations in (13.6). 

3. Because of conditions 1 and 2 all treatment combinations satisfying (13.6) 
also enter with the same sign into any GI of the E ai (i = 1,2,, £). 


It then follows that the E ai and all their GIs are confounded with (or inseparable 
from) the mean M, so that the identity relationship is then 


五 ai+a2 = = 五 ai+a3 = 五 (*2+(X3 


— ^ai+a 2 +a 3 =...= 五 ai+o^H -…- 


(13.7) 


that is, (13.7) contains \ interactions. The alias structure is obtained by 
taking an not represented in (13.7) and multiplying it into each term in 

(13.7), that is, is confounded with 


厂 ai+ 泠 Z7«2+^ Z7 a l+ a 2+^ Z? a 3+^ rai+«2+.--+a£+^ 

M-j ^ I-j ^ ^ I-J J ^ 


(13.8) 


There are s = 2 2- ^ — 1 such interactions, say E^j(j = 1,2,... ,s) and each 
is aliased with 〆 一 1 other interactions according to (13.8). We thus have the 
complete alias structure for the 2 n ~ i design based on (13.7). 

It is, of course, clear from Eqs. (13.6) that there exist 2^ different fractions 
2 n ~^ depending on the choice of the Si(i = 1,2,, l). To specify which frac¬ 
tion we are choosing we may attach a plus or minus sign to the independent 
interactions E ai (i = 1, 2, ...，€) so that -\-E ai (—E ai ) means that we take all 
treatment combinations that enter positively (negatively) into E ai . Regardless of 
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what fraction we choose, as long as it is based on one of the possible sets of Eqs. 
(13.6), the alias structure remains the same, only the estimable linear functions 
of effects and interactions change. 

To illustrate these concepts we consider the following example. 

Example 13.1 Let n = 5 and 1 = 2, that is, we consider a ^ replicate of a 
2 5 design or a 2 5-2 design. We may specify (13.6) as 

Xi + X2 + X 3 =0 

X\ + X4 + X5 = 0 


which are satisfied by the treatment combinations 

(1), be, abd, acd, abe, ace, de, bede 
These treatment combinations also satisfy the equation 

^2 + -^3 + -^4 + -^5 = 0 

and, moreover, these treatment combinations enter negatively into ABC and ADE 
and hence positively into their GI BCDE. The identity relationship therefore is 

I = -ABC= -ADE= +BCDE (13.9) 


From (13.9) we then obtain the alias structure given in Table 13.3. It, 
together with (13.9)，also shows which parametric functions are estimable, for 
example, A — BC — DE + ABCDE, which is defined as ^[abd + acd + abe + 
ace — (1) — be — de — bede]. Had we selected the eight treatment combinations 
according to the equations 


Xi + X2 + X 3 =1 

X\ + X 4 + X5 = 0 

又2 + 义3 + 又 4 + 又5 = 1 


Table 13.3 Alias Structure 
for 2 5-2 Design 


A = 

BC = 

DE =BCDE 

B = 

AC = 

ABDE = 

: CDE 

C = 

: AB =. 

ACDE = 

BDE 

D = 

--ABCD 

=AE = 

BCE 

E = 

• ABCE 

=AD = 

BCD 

BD 

= ACD 

= ABE 

=CE 

BE 

= ACE 

= ABD : 

=CD 
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that is, 


b, c, ad, ae, bde, cde, abce, abed 
the identity relationship would have been 

I = ABC = -ADE = -BCDE (13.10) 


From the alias structure given in Table 13.3 and from (13.10) we see that now, 
for example, ABC — DE — ABCDE is estimable. □ 

Rather than specifying the identity relationship (13.7) by attaching plus or 
minus signs to the E 011 , we can indicate the specific fractional replicate by writ¬ 
ing E^!, where 5 / is the ith right-hand side in (13.6)，so that according to the 
definition of given in Section 7.4 this implies that we select all treatment 
combinations that satisfy the equation 

OiilXi + a t 2X2 H - h 0i in x n = (5/ 

(i = 1,2,..., i). We shall see that this notation is useful when we consider the 
p n - 1 system (see Section 13.5). 


13.3.2 Resolution of Fractional Factorials 

As has been mentioned earlier the value of fractional replication lies in the fact 
that often higher-order interactions are negligible. One would therefore consider 
fractions that result in the confounding of main effects and low-order interactions 
with higher-order interactions. The extent to which this is possible is expressed 
clearly by the form of the identity relationship. Basically, the minimum number of 
factors involved in an interaction included in the identity relationship determines 
the resolution of the design, that is, the extent to which main effects and low- 
order interactions are estimable under the assumption that certain higher-order 
interactions are negligible. Following the terminology introduced by Box and 
Hunter (1961a, 1961b), we distinguish between the following types of fractional 
replicates. 

Resolution III Designs No main effect is confounded with another main effect, 
but main effects are confounded with 2-factor interactions. 

Resolution IV Designs No main effect is confounded with another main effect 
or a 2-factor interaction, but 2-factor interactions are confounded with one 
another. 

Resolution V Designs No main effect is confounded with another main effect 
or 2-factor interaction, and 2-factor interactions are confounded with 3- 
factor interactions. 
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These definitions apply, strictly speaking, only to so-called regular fractional 
factorial designs for symmetrical factorials. A regular fraction is, for example, 
a 去 of a 2 〃 factorial as compared to an irregular fraction such as a 吾 of a 
factorial (see, e.g., Addelman, 1961; Webb, 1971). A more general definition was 
given by Webb (1964). 

Definition 13.1 A fractional factorial design is of resolution (2R + 1) if it 
permits the estimation of all effects up to 7?-factor interactions assuming that all 
interactions involving more than R factors are negligible (R = 1,2,...). A design 
is of resolution 2R if it permits the estimation of all effects up to (R — 1)-factor 
interactions assuming that all interactions involving (R + 1) factors are negligible 
(R = 2, 3 ,...). □ 

As a consequence of this definition, a resolution III design (i.e., R = 1) per¬ 
mits the estimation of main effects assuming that all interactions are negligible, 
and a resolution IV design (i.e .，R = 2) permits the estimation of main effects 
assuming that all 3-factor and higher-order interactions are negligible. The design 
given in Example 13.1 is a resolution III design, which we indicate by calling 
it a 2 n Y design. Note also that the lowest order interaction in the identity rela¬ 
tionship involves three factors or, using different terminology, the smallest word 
(i.e. ， interaction) contains three letters (i.e. ， factors). A 2jy ^ is given in the 
following example. 


Example 13.2 Let n = 6, l = 2, and 

I = ABCD = CDEF = ABEF (13.11) 

The 16 treatment combinations are obtained by solving the equations 

Xi + X2 + X3 + X4 = 0 
X3 + X4 + X5 + X6 = 0 


It is clear from (13.11) that main effects are confounded with 3-factor interactions. 

We can use SAS PROC FACTEX to generate this design. It is given in 
Table 13.4 ^ ^ ^ 

The output shows that the defining relationship 

I = BCDE = ACDF = ABEF 


was used, which is of the same form as (13.11). As a consequence seven of the 
fifteen 2-factor interactions are confounded with one other 2-factor interaction and 
one 2-factor interaction is confounded with two other 2-factor interactions. □ 
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13.3.3 Word Length Pattern 

In discussing the resolution of a fractional factorial design, we have referred to 
the interactions in the defining relationship as words, and the number of fac¬ 
tors involved in an interaction is called the word length. Typically, a defining 
relationship contains several words of possibly different length. This is referred 
to as the word length pattern (WLP) of the defining relationship. A 2 n ~^ frac¬ 
tion may be constructed by using different defining relationships with different 
WLPs. More precisely, a defining relationship can be characterized with respect 
to its general properties by giving the distribution of words of different length. 
Denoting the number of words with in letters, that is, an m-factor interaction, by 
L m ，we denote the WLP of an identity relationship by 


WLP(L 3 ， L 4 , … 上） 


We ignore designs with words of length 1 and 2 since they are obviously of no 
practical interest as the smallest L m ^0 determines the resolution of the design. 
For a 2 n ~ i design we then have 


m=3 



For the designs in Example 13.2 and in Table 13.4 we have WLP(0, 3, 0, 0)，and 
it is easy to see that this is the only possible WLP for a 2 6-2 design. 


13.3.4 Criteria for Design Selection 

In general, however, there may be different WLPs for a 2 n ^~ l design, that is, 
different identity relationships. In such situations the particular WLP can be used 
to distinguish among competing designs and select the one that is “best.” And 
best in our context means the largest number of estimable, that is, unconfounded, 
effects or interactions. For example, for a resolution III we would like the number 
of main effects that are confounded with 2-factor interactions as small as possible, 
or expressed differently, the number of main effects not confounded with 2-factor 
interactions as large as possible. For a resolution IV design we would like to 
have as few 2-factor interactions confounded with each other. Wu and Chen 
(1992) called a main effect or 2-factor interaction clear if none of its aliases 
are main effects or 2-factor interactions, and strongly clear if none of its aliases 
are main effects, 2-factor interactions or 3-factor interactions (see also Wu and 
Hamada, 2000). 

Another criterion to distinguish among competing designs with maximum 
possible resolution, R max say, was introduced by Fries and Hunter (1980). 
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Table 13.4 1/4 Replicate of 2 6 Factorial of Resolution IV 


options nodate pageno=l ； 
proc factex ； 
factors A B C D E F ； 
size fraction=4 ； 
model res=4 ； 

examine design aliasing confounding ； 
titlel 'TABLE 13.4'; 

title2 '1/4 REPLICATE OF 2**6 FACTORIAL'; 

title3 'OF RESOLUTION IV ; 

run ； 

The FACTEX Procedure 
Design Points 

Experiment 

Number A B C D E F 


1 

-1 

-1 

-1 

-1 

-1 

-1 

2 

-1 

-1 

-1 

1 

1 

1 

3 

-1 

-1 

1 

-1 

1 

1 

4 

-1 

-1 

1 

1 

-1 

-1 

5 

-1 

1 

-1 

-1 

1 

-1 

6 

-1 

1 

-1 

1 

-1 

1 

7 

-1 

1 

1 

-1 

-1 

1 

8 

-1 

1 

1 

1 

1 

-1 

9 

1 

-1 

-1 

-1 

-1 

1 

10 

1 

-1 

-1 

1 

1 

-1 

11 

1 

-1 

1 

-1 

1 

-1 

12 

1 

-1 

1 

1 

-1 

1 

13 

1 

1 

-1 

-1 

1 

1 

14 

1 

1 

-1 

1 

-1 

-1 

15 

1 

1 

1 

-1 

-1 

-1 

16 

1 

1 

1 

1 

1 

1 


Factor 

Confounding 

Rules 




E = B*C*D 
F = A*C*D 


Aliasing Structure 

A 

B 

C 

D 

E 


F 
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Table 13.4 (Continued) 


A*B = E*F 

A*C = D*F 

A*D = C*F 

A*E = B*F 

A*F = B*E = C*D 

B*C = D*E 

B*D = C*E 


Suppose we have two competing 2 n ~ t designs, D\ and D 2 say, of maximum 
resolution and suppose that the WLP are given as 

D\ \ WLP(L 3 , L 4 ,..., L n ) 

and 

d 2 ： wlp(z/ 3 ,z4 ， ...’l;) 

Consider then the first i for which L/ ^ ^ - (^max < i < ^)- If Li < L-, then D\ 
is said to have less aberration than D 2 , otherwise D 2 has less aberration. Among 
all competing designs the one with the smallest Li is said to be the minimum 
aberration design. 

As an illustration of these concepts, we consider the following example. 

Example 13.3 Let n = S, l = 3, and R = IV; that is, we are interested in a 
2jy 3 design consisting of 32 runs. 

Design 1 

I = ABCDEF = CDEG = ABFG 
=BDEH = ACFH = BCGH = ADEFGH 

with WLP(0, 5, 0, 2, 0, 0,). This design, as constructed by SAS PROC FACTEX, 
is given in Table 13.5. Inspection of the alias structure shows that all main effects 
are clear, but not strongly clear. Also, only 4 of the 28 two-factor interactions 
are clear (actually, strongly clear), all other 2 -factor interactions are confounded 
with other 2 -factor interactions. 

Design 2 

I = CDEF = ABDEG = ABCFG 
=ABC EH = ABDFH = CDGH = EFGH 

with WLP(0, 3, 4, 0, 0, 0). This design, given in Table 13.6, was constructed 
by SAS PROC FACTEX using the option “minabs” to produce a minimum 
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Table 13.5 l/8th Fraction of the 2 8 Factorial of Resolution IV 


options nodate pageno=l ； 
proc factex ； 

factors ABCDEFGH; 
size design=32 ； 
model res=4 ； 

examine design aliasing confounding ； 
titlel 'TABLE 13.5'; 

title2 '1/8TH FRACTION OF THE 2**8 FACTORIAL 1 ; 

title3 'OF RESOLUTION IV'; 

run ； 

The FACTEX Procedure 
Design Points 

Experiment 

Number ABCDEFG H 
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Table 13.5 (Continued) 


Factor Confounding Rules 

F = A*B*C*D*E 
G = C*D*E 
H = B*D*E 


Aliasing Structure 

A 

B 

C 

D 

E 

F 

G 

H 

A*B = F*G 
A*C = F*H 
A*D 
A*E 

A*F = B*G = C*H 

A*G = B*F 

A*H = C*F 

B*C = G*H 

B*D = E*H 

B*E = D*H 

B*H = C*G = D*E 

C*D = E*G 

C*E = D*G 

D*F 

E*F 


aberration design (L 4 = 3 in D 2 vs. L 4 = 5 in D\). All main effects are clear, of 
course, but only A and B are strongly clear. Now thirteen 2-factor interactions 
are clear, but none of them are strongly clear. □ 

Intuitively, minimum aberration designs should be the best designs with respect 
to the estimation of 2-factor interactions. However, Wu and Hamada (2000) in 
their listing of useful designs give a few examples of resolution IV designs, 
where a judicious choice of the identity relationship leads to a design with more 
clear 2 -factor interactions than the corresponding minimum aberration design (see 
also Wu and Wu, 2002). Hence, both concepts should be used when selecting a 
design for a particular application. 

Still other criteria have been introduced to further enable the search for and 
study of the properties of “desirable” designs, such as estimation capacity (Cheng, 
Steinberg, and Sun, 1999) and estimation index (Chen and Cheng, 2004). In 
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Table 13.6 l/8th Fraction of the 2 8 Factorial of Resolution IV and Minimum 
Aberration 


options nodate pageno=l ； 
proc factex ； 

factors ABCDEFGH; 
size design=32 ； 
model res=4/minabs ； 

examine design aliasing confounding ； 
titlel 'TABLE 13.6'; 

title2 '1/8TH FRACTION OF THE 2**8 FACTORIAL 1 ; 
title3 'OF RESOLUTION IV AND MINIMUM ABERRATION'; 
run ； 

The FACTEX Procedure 
Design Points 

Experiment 

Number ABCDEFG H 
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Table 13.6 (Continued) 


Factor Confounding Rules 

F = C*D*E 
G = A*B*D*E 
H = A*B*C*E 


Aliasing Structure 

A 

B 

C 

D 

E 

F 

G 

H 

A*B 

A*C 

A*D 

A*E 

A*F 

A*G 

A*H 

B*C 

B*D 

B*E 

B*F 

B*G 

B*H 

C*D = E*F = G*H 

C*E = D*F 

C*F = D*E 

C*G = D*H 

C*H = D*G 

E*G = F*H 

E*H = F*G 


many cases application of these criteria leads to the same design. For a detailed 
discussion of the relationship among these criteria, we refer the reader to Chen 
and Cheng (2004). 


13.4 FRACTIONAL REPLICATES FOR 3 n FACTORIAL DESIGNS 

The basic ideas for the 2 n ~^ design can be extended easily to the 3 n ~ l design, 
that is, the 長 fraction of the 3 n factorial. To obtain the treatment combinations 
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for such a fraction we have to specify an identity relationship containing l 
independent interactions. If we denote these interactions by E ai ， E a2 ,..., E 011 , 
we can then write 


I = Eg = Eg = … = (13.12) 
The treatment combinations are obtained by solving the equations (mod 3) 


Oi\\X\ + 0^12^2 H - h 0 i\ n x n = 8 \ 

^ 21-^1 + 仅 22 又 2 + . . . + Oi2nX n = ^2 ( 13 . 13 ) 


Oii\X\ + ai2X2 H - h OLlnXn = ^ 

For any choice of 5i, 知，… ， 如 there will be exactly 3 n ~^ treatment combinations 
satisfying (13.13). Each such set, and there will be 3 £ possible sets, constitutes 
a 3 n ~^ fractional factorial. If, for some reason, we want the fraction to contain 
the “control，” that is, the treatment combination (0, 0 , ... ， 0 )， then we choose 
= S 2 = ' •' = = 0. 

We know, of course, that any treatment combination x = (xi,X 2 , ..., x n Y sat¬ 
isfying Eqs. (13.13), that is, a-x = 5/( / = 1, 2,..., €)， also satisfies the equations 



(13.14) 


where \ = 0, 1 ， 2(/ = 1,2,..., V). For a given set of 入 1 , 入 2 ,… ，入 € (not all 
zero) let us denote ^ A./ a, by a* and / by 5*. Thus (13.14) becomes 

(a*)，x = 5* (13.15) 


Each a* corresponds to an effect E a * and the treatment combinations satisfying 
(13.15) belong to the component with 5* = 0, 1, 2. It is not difficult to 

see that, using our convention that the first nonzero element in a* is equal to 
one, there are altogether (3^ — 1)/2 such a* including a\, « 2 , •. •, 0 Lf>. And it 
is obvious that the a* other than ai, 0 C 2 , • • •, «£ correspond to all possible GIs 
among E ai ， E a2 ,..., E ai . Hence the complete identity relationship is obtained 
by adding to (13.12) all the GIs, that is, 

T — F ai — F 012 — ... — F 011 — F ai+a2 

C >1 <32 0\+02 




/ 7 ai+ 2 a 2 

〜 1 + 2 知 


,ai+2a2 + •••+2a^ 
5i+2^2+...+2^ 


(13.16) 
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In order to obtain the treatment combinations for the 3 n _ £ design, we need 
to know only (13.12) and solve Eqs. (13.13)，but in order to obtain the alias 
structure we need to use the entire set of interactions in the identity relationship as 
given in (13.16). Suppose is an effect not contained in (13.16). We know that 
because of orthogonality (see Chapter 10) exactly ^ of the treatment combinations 
satisfying (13.13) also satisfy the equation 

+ ^2 X 2 + . • . + Pn x n = ^ (13.17) 

for 8 = 0, 1, 2 and each treatment combination satisfies only one of the three 
equations. Then is defined, as with the full set of 3 〃 treatment combinations, 
by contrasts of the form 

ciE^ + c 2 E^ (13.18) 

with co + ci + C 2 = 0， where is obtained (apart from the overall mean) 
from the 3 W_ ^ _1 treatment combinations satisfying (13.17) for 5 = 0, 1, 2. Let 
us take now any of the interactions in (13.16), say , and consider the GI 
between and E ai ， denoted by E^ +oti . A contrast belonging to this effect is 
given by 

c 0 E^ ai ~^c 2 E^ ai 

where the leading mean in E^ +oti is obtained from the treatment combinations 
satisfying 

(ySi +an)xi + + a ； 2 ) x 2 + + oii„) x n = S (13.19) 

We know that 

diixi + a i2 x 2 H - h a in x n = 8i (13.20) 

Substituting (13.20) in (13.19) yields 

+ PlX2 + ••• + priori = S — 8i 

Now 8 — 8i takes on the values 0, 1, 2 (mod 3)，and it is therefore obvious that a 
contrast belonging to is also a contrast belonging to E@ +CCi , that is, and 

E Ui are confounded. The same argument holds for the other GI between and 
E ai , namely E^ +2oli . This then explains how we use (13.16) to obtain the alias 
structure for the 3 n ~^ design: Any effect not in (13.16) is confounded with 

all GIs obtained by formally multiplying into E y and E 2y where E y is any 

effect listed in (13.16)，that is, is confounded with and E^ +2y . We 

illustrate this below. 
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Example 13.4 Consider the 3 4-2 design with the identity relationship 
given by 

I = ABCl = ACD 0 = AB 2 Dl = BCD% (13.21) 

where ABC 2 and ACD are the two independent interactions of (13.12) and 
AB 2 D 2 , BCD 2 are their GIs. The treatment combinations are obtained from 
the equations 

义 1 + ^2 + 2 X 3 = 0 

X\ + X3 + X4 = 0 

They are 

0000 0112 0221 

1011 1120 1202 

2022 2101 2210 

The alias structure is given by multiplying each effect not contained in (13.21) 
into each interaction in (13.21) and its square (ignoring the subscripts). We obtain 
(using reduction mod 3): 

A = A{ABC 2 ) = A(ABC 2 ) 2 = A(ACD) = A(ACD) 2 
=A(AB 2 D 2 ) = A(AB 2 D 2 ) 2 = A(BCD 2 ) = A(BCD 2 ) 2 
=AB 2 C = BC 2 = AC 2 D 2 = CD = ABD = BD = ABCD 2 = AB 2 C 2 D 
Similarly 

B = AB 2 C 2 = AC 2 = ABCD = AB 2 CD = AD 2 = ABD 2 
= BC 2 D = CD 2 

C =AB = ABC = AC 2 D =AD = AB 2 CD 2 = AB 2 C 2 D 2 
= BC 2 D 2 = BD 2 

D = ABC 2 D = ABC 2 D 2 = ACD 2 = AC = AB 2 = AB 2 D 
二 BC =BCD 

Together with the effects in (13.21) these account for all the effects of a 3 4 
factorial. We recognize, of course, that this design is a resolution III design, which 
allows the estimation of all main effects only if all interactions are negligible. 
An equivalent design, using SAS PROC FACTEX, is given in Table 13.7. 
The factor confounding rule states that the levels of C are obtained as twice 
the levels of A plus twice the levels of B, and that the levels of D are obtained as 
the levels of A plus twice the levels of B and reduction mod 3. The output shows 
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Table 13.7 1/9 Fraction of the 3 4 Factorial of Resolution III 


options nodate pageno=l ； 
proc factex ； 
factors ABC D/nlev=3; 
size fraction=9; 
model res=3 ； 


examine design aliasing confounding ； 
titlel 'TABLE 13.7'; 

title2 '1/9 FRACTION OF THE 3**4 FACTORIAL'; 

title3 'OF RESOLUTION III'; 

run ； 

The FACTEX Procedure 


Experiment 

Number 


Design Points 

ABC 


D 


1 -1 -1 

2-10 
3-11 

4 0-1 

5 0 0 

6 0 1 

7 1-1 

8 10 

9 11 


1 

0 

1 

0 



1 

0 

0 

1 

1 

1 

0 


Factor Confounding Rules 








C = 

(2*A) + (2*B) 







D = 

A + 

(2*B) 

aliasing 

Structure 





A 

= 

(2*B) 

+ 

(2*C) 

= 

B + 

D = 

C + (2*D) 

(2*A) 

= 

B + C 

= 

(2*B) 

+ 

(2*D)= 

(2*C) + D 

B 

= 

(2*A) 

+ 

(2*C) 

= 

A + 

(2*D) = C + D 

(2*B) 

= 

A + C 

= 

(2*A) 

+ 

D = 

(2*C) + (2*D) 

C 

= 

(2*A) 

+ 

(2*B) 

= 

A + 

D = 

B + (2*D) 

(2*C) 

= 

A + B 

= 

(2*A) 

+ 

(2*D)= 

(2*B) + D 

D 

= 

A + (2*B) = (2*A) + 

C = 

B + (2*C) 

(2*D) 

二 

(2*A) 

+ 

B = A 

+ 

(2*C)= 

(2*B) + C 


that the levels of C are indeed computed in this fashion, but the levels of D are 
obtained by subtracting 1 from the results of the rule given above (presumably 
this is done to include the control in the fraction). The alias structure shows that, 
just as in Example 13.4, the main effect A is confounded with 2 d.f. of each of the 
2-factor interactions B x C, B x D, and C x D (and higher-order interactions). 






GENERAL CASE OF FRACTIONAL REPLICATION 


529 


The notions of clear effects and minimum aberration design can be carried 
over in an obvious fashion to 3 n _ £ designs. A listing of useful designs is given 
by Wu and Hamada (2000). □ 

13.5 GENERAL CASE OF FRACTIONAL REPLICATION 

13.5.1 Symmetrical Factorials 

From our discussion above it is quite obvious how the concept of fractional 
replication can be extended to the general case, that is, to the 1/〆 fraction 
of the s n factorial where s is a prime or prime power. All we need to do is 
specify i independent interactions, say E ai , E a2 ,..., E 011 and the s n ~ l treatment 
combinations are then obtained by solving the t equations 

的 1 又 1 + ^i2-^2 + ... + ai n x n = 8[ (i = 1 ， 2 ， ... ，名） 

where ot”, Xj, 8i are elements in GFO) (/ = 1, 2, y = 1,2,..., n). The 
identity relationship is given by 

j ― = E 012 = • •. = E ae = E 011 ^ 012 = £ ， a i+ M 2«2 

_ . . . _ ^ai+M 5 _iat2 _ . . . _ 五《 1+ 〜 _10?2+… (13 22) 

[uj e GF^)] that is, (13.22) contains the i independent interactions and all their 
GIs. It follows then that an effect E 01 is aliased with all the GIs between E a and 
the interactions in (13.22). 

13.5.2 Asymmetrical Factorials 

The basic ideas of fractional replication for symmetrical factorial experiments 
as presented above can be extended and applied also to asymmetrical factorials. 
Suppose we want to choose a fraction of an s 1 ^ 1 x x ... x s^ q factorial. An 
obvious and easy method would be to consider a l — in — s^s^ 2 , • • • ,Sq q repli¬ 
cate consisting of , s ; 2 —^ 2 ,... ,Sq q treatment combinations. The idea, 
of course, is to consider each of the q symmetrical component systems sep¬ 
arately, obtain a 1 /sf l fraction of the s 1 ^ 1 factorial (i = 1,2, q ) and then 
combine those fractions as a Kronecker product design. More specifically, let 
Xi = (x/i, jc/ 2 , ... ， Xi Vi ) f denote the y/ = treatment combinations for the 
fractional factorial design of the s? factorial, abbreviated FFD^/, rii, li), for 
i = 1,2,... ,q, with Xij = (xij\,Xij 2 , ..., Xij ni ) f and Xijk the level of the 众 th 
factor in the jth treatment combination of FFD(^/, n/, li). Then the treatment 
combinations for the fractional replicate of the asymmetrical factorial x x 
■■■ x Sq q are given by the symbolic direct product (see Section 11.4.1) x\ 0 X 2 ⑭ 
• • • 0 Symbolically, we may write this also as nLiX FFDfe, n u li). 
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Now each FFD ( 57 , rii, i[) is based on an identity relationship, // say, 
containing ti independent interactions and all their GFs. Thus /, determines 
the alias structure for FFD( 々， n“ ti) involving the ni factors in the facto¬ 
rial. We simply apply the rules given above. In addition, we have confounding 
among mixed interactions, that is, interactions involving factors from different 
symmetrical factorials. We note here that the alias structure for mixed interactions 
may be more advantageous than those for interactions in the symmetrical compo¬ 
nent systems, advantageous in the sense of being confounded with higher-order 
interactions that may be assumed negligible. 

We shall illustrate these ideas with the following example. 

Example 13.5 Consider a ^ replicate of the 2 3 x 3 3 factorial; that is, q = 2, 

= 2, n\ = 3, t\ = 1, ^2 = 3, ^2 = 3, ^2 = 1- Let A\, A 2 , A 3 and B\, B 2 , 
denote the factors for the two symmetrical component systems, respectively. To 
obtain FFD(2, 3, 1) we use 

h = (^1 M ^3)0 

and for FFD(3, 3, 1) we use 

h = (fil B 2 b|)o 


It follows that 



(0 

0 

0 、 


1 

1 

0 

X l = 

1 

0 

1 



1 

V 

and 





/o 

0 



1 

2 

0 


2 

1 

0 


0 

1 

1 

^2 = 

1 

0 

1 


2 

2 

1 


0 

2 

2 


1 

1 

2 



0 

2) 


The 36 treatment combinations are obtained by combining every treatment com¬ 
bination in x\ with every treatment combination in The alias structure for 
FFD(2, 3, 1) is given by 


A\ = A2A3 
M = A1A3 
A3 = A1A2 
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and for FFD(3, 3, 1) by 

B\ = == B2 

B2 = = B\B^ 

B3 = B\B2 = B1B2B3 
B1B2 = B1B3 = B2B3 

The alias structure for mixed interactions is illustrated by 

A\ x B\ = A\ x BxB^Bi, = A\ x B2B2 = A2A3 x B\ 

= A2A3 x = A2A3 x B2 

This shows that the 2-factor interaction A\ x B\ is confounded with 3-factor or 
higher-order interactions. 

An isomorphic design using SAS PROC FACTEX is given in Table 13.8. □ 


13.5.3 General Considerations 

From a practical point of view highly fractionated designs may be the only feasi¬ 
ble designs, requiring only a relatively small number of treatment combinations. 
This has, of course, fairly severe consequences as each effect will be confounded 
with a “large” number of other effects. For such a design to be useful the identity 
relationship must be chosen carefully in order to avoid having effects of interest 
confounded with each other. For example, if main effects and some or all 2-factor 
interactions are considered to be important, then certainly main effects should 
not be confounded with other main effects and 2-factor interactions. If only some 
2-factor interactions are important, then confounding of certain 2-factor interac¬ 
tions is permissible, but if all 2-factor interactions are important, then they must 
only be confounded with 3-factor or higher-order interactions. In other words, 
resolution IV or V designs are called for. It is obvious that the more effects we 
want to estimate the more treatment combinations we need. It is typical that for 
experiments having to use fractional factorial designs the two requirements of a 
limited number of treatment combinations and of information about a specified 
number of effects are often in conflict with each other and compromises will have 
to be made. It is therefore important to be aware of methods of constructing frac¬ 
tional factorial designs of minimal size to achieve certain objectives such as those 
mentioned above. This leads also to consideration of fractional replicates other 
than the regular fractions discussed so far. A general discussion of the structures 
and properties of fractional factorial designs is given by Raktoe, Hedayat, and 
Federer (1981), and a description of construction methods for certain fractional 
factorial designs is provided by Dey (1985). We shall discuss some of these 
issues in Section 13.6. 
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Table 13.8 1/6 Fraction of the 2 3 x 3 3 Factorial Using a Kronecker Product Design 

options nodate pageno=l ； 
proc factex ； 
factors A1 A2 A3; 
size design=4 ； 
model res=3 ； 

examine design confounding aliasing ； 
output out=two ； 
titlel 'TABLE 13.8'; 

title2 '1/6 FRACTION OF THE 2**3x3**3 FACTORIAL 1 ; 

title3 'USING A KRONECKER PRODUCT DESIGN'; 

run ； 

factors B1 B2 B3/nlev=3; 
size design=9; 
model res=3 ； 

examine design confounding aliasing ； 
output out=three 
designrep=two ； 
run ； 

proc print data=three ； 
run ； 

The FACTEX Procedure 
Design Points 
Experiment 

Number A1 A2 A3 


1 -1 -1 
2-11 

3 1-1 

4 11 

Factor Confounding Rules 
A3 = A1*A2 



Aliasing Structure 

A1 = A2*A3 
A2 = A1*A3 
A3 = A1*A2 
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Design Points 

Experiment 

Number B1 B2 B3 


Factor Confounding Rules 


B3 = (2*B1) + (2*B2) 


Aliasing Structure 


B1 = (2*B2) + (2*B3) 
(2*B1) = B2 + B3 

B2 = (2*B1) + (2*B3) 
(2*B2) = B1 + B3 

B3 = (2*B1) + (2*B2) 
(2*B3) = B1 + B2 
(2*B1) + B2 = B1 + (2*B3) 
B1 + (2*B2) = (2*B1) + B3 


(2*B2) + B3 
B2 + (2*B3) 


A1 A2 
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Table 13.8 (Continued) 


20 

1 

-1 

-1 

-1 

mm 

l 

21 

1 

-1 

-1 

-1 

i 


22 

1 

-1 

-1 

，: IK 

-i 

l 

23 

1 

-1 

-1 


mm 


24 

1 

-1 

-1 


l 

-l 

25 

1 

-1 

-1 

l 

-l 


26 

1 

-1 

-1 
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-l 

27 

1 

-1 

-1 
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l 

l 

28 

1 

1 

1 

-l 

-l 

-l 

29 

1 

1 

1 

-l 

mm 

l 

30 

1 

1 

1 

-l 

l 


31 

1 

1 

1 


-l 

1 

32 

1 

1 

1 


mm 


33 

1 

1 

1 

MM ： 

l 

-1 

34 

1 

1 

1 

1 

-l 


35 

1 

1 

1 

1 

mm 

-l 

36 

1 

1 

1 

1 

l 

l 


13.5.4 Maximum Resolution Design 

The discussion in Section 13.6 will allow us to make statements about the min¬ 
imum size of a design for a specified resolution. An equally important question 
arises if we are given a fixed number, say N, of runs and we want to know what 
is the maximum resolution that can be achieved under this constraint. 

It is clear, of course, that a l/s fraction of the s n factorial has resolution 
R = n. For the general case of the s n ~ l fractional factorial an upper bound of 
the resolution was given by Fujii (1976). We state his result in the following 
theorem without proof. 

Theorem 13.1 (Fujii, 1976) Let M = (s^ — 1)/(^ — 1) and n = qM + m, 
with q >0 integer. Then, for i >2, the maximum resolution Ri(n, 5 1 ) of the 
s n ~^ design (with 夕 prime or prime power) is given by 

Rl(n, s) = s^~ l q if m = 0, l 

< s l ~ x q + s l ~ 2 (s - l)(m - 1)/(/ - 1 - 1) 

if m = 2, 3, ... ， / 一 1 - 1 

< s l ~ x q + [(s - l)m/s] 


if m = /― 1 ， / 一 1 + 1， ... ， M - 1 
where [x] is the greatest integer not exceeding x. 


In Table 13.9 we present R m ax values for 5 1 = 2, 3, 4, 5 and 4 < n < 10, 2 < 
i <1 with the number of runs < 256. An asterisk indicates that R max < Rt(n, s) 
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Table 13.9 Maximum Resolution s n ~^ Designs 
s n l 尺 max Number of Runs 

2 5 2 3 8 


6 

2 

4 

16 

6 

3 

3 

8 

7 

2 

4 

32 

7 

3 

4 

16 

7 

4 

3 

8 

8 

2 

5 

64 

8 

3 

4 

32 

8 

4 

4 

16 

9 

3 

4 

64 

9 

4 

4 

32 

9 

5 

3* 

16 

10 

3 

5 

128 

10 

4 

4* 

64 

10 

5 

4 

32 

10 

6 

3* 

16 

4 

2 

3 

9 

5 

2 

3 

27 

6 

2 

4 

81 

6 

3 

3 

27 

7 

2 

5 

243 

7 

3 

4 

81 

7 

4 

3* 

27 

8 

3 

5 

243 

8 

4 

4 

81 

8 

5 

3* 

27 

9 

4 

5 

243 

9 

5 

3* 

81 

9 

6 

3* 

27 

10 

5 

5* 

243 

10 

6 

3* 

81 

10 

7 

3* 

27 

4 

2 

3 

16 

5 

2 

4 

64 

5 

3 

3 

16 

6 

2 

4 

256 

6 

3 

4 

64 

7 

3 

4 

256 

7 

4 

3* 

64 

8 

4 

4* 

256 

8 

5 

3* 

64 

9 

5 

4* 

256 

9 

6 

3* 

64 

10 

6 

4* 

256 

10 

7 

3* 

64 
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Table 13.9 (Continued) 

s n i Rmax Number of Runs 


4 

2 

3 

25 

5 

2 

4 

125 

5 

3 

3 

25 

6 

3 

4 

125 

6 

4 

3* 

25 

7 

4 

3* 

125 

8 

5 

3* 

125 

9 

6 

3* 

125 

10 

7 

3* 

125 


as given above [it shows that Ri(n, s) is not particularly good for larger values 
of l, a fact also observed by Fries and Hunter (1980) who provided a different 
bound for s = 2]. 

13.6 CHARACTERIZATION OF FRACTIONAL FACTORIAL 
DESIGNS OF RESOLUTION III ， IV， AND V 

13.6.1 General Formulation 

Recall from (12.15) that each true treatment response, a(v), for a s^ 1 x s^ 2 x 
…乂 s n q q factorial with s\ < S 2 < ' • • < s q can be expressed in terms of the mean 
response, a component from each pure partition and a component from each 
mixed partition. Of interest here are only all components from pure partitions 
belonging to main effects and 2-factor interactions and all components from 
mixed partitions belonging to 2-factor interactions. As an illustration we shall 
consider the 2 2 x 3 2 factorial. 

Let A\, A2 denote the two factors with two levels each, and B\, B2 the two 
factors with three levels. Recall from Example 12.4 that all arithmetic has to be 
done in R(6) and that the levels of A\, A2 are denoted by 0, 3 and those of B\, 
B2 by 0, 4, 2. To simplify the notation we shall write the treatment combination 
V = Oil, V12, U21. V 22 ) r as I ； = (Uj, v' 2 )' where q = (i ； n, v\2)', v 2 = O21 ， V22)' ■ 
We then have 

Cl(v) = d{V\, V 2 ) = ~h (-^i)(30)vi (*^)(03)vi + 

+ (Bi)(40)v 2 + ( 的 )(04)V2 + ( 的的 )(44)i ；2 
+ (5^B|)(42 )i ； 2 + + (A )(3004)v 

+ (^2 )(0340)1 ； + ( A ^2 )*0304) v ( 13 . 23 ) 
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Alternatively, and equivalently, we can express a(v) in terms of the traditional 
model for factorial structures as 


a(v) - fx + ai + pj + ㈣)” + Yk + S e 

+(yS)ke + (oiy)ik + (Py)jk 

+(aS) u + (pS) jC (13.24) 

with i, j = 0 ， l, k, l = 0, 1, 2. Models of the type (13.24) are used in SAS, for 
example. 

We ignore 3-factor and higher-order interactions since those are assumed 
to be negligible for resolution III, IV, and V designs. In addition, for resolu¬ 
tion III designs 2-factor interactions are also negligible, and we are interested 
only in estimating the mean and main effects. For resolution IV designs we 
are mainly interested in estimating main effects, but the mean and 2-factor 
interactions are assumed to be unknown (and not necessarily zero). For reso¬ 
lution V designs we wish, of course, to estimate the mean, main effects, and 
2-factor interactions. All three situations can be expressed in very general terms 
in the form of a three-part linear model, the first part referring to the mean, 
the second part referring to the main effects, and the third part to the 2-factor 
interactions. We formulate this as follows (see Raktoe, Hedayat, and Federer, 
1981). 

Let D denote an FFD consisting of N treatment combinations (not necessarily 
all distinct) and let denote the TV x 1 vector of the true treatment responses 
associated with D. Further, let M denote the mean, E\ the x 1 vector of main 
effect components and E 2 the 奶 x 1 vector of 2-factor interaction components 
[using either (13.23) or (13.24)], with E\ and E 2 conformable to D and the 
factorial from which D was obtained. We then write, analogously to (12.15), 

( M \ 

a D = {3,X lD ,X 2D )\EA (13.25) 

where X\d and X^d are N x q\ and N x q2 incidence matrices, respectively. 
If represents the iV x 1 vector of observations corresponding to ao, then we 
write (13.25) alternatively and in a more familiar form as 

( M \ 

E[y D ]^(0,X lD ,X 2D ) I £1 I (13.26) 


In order to determine whether a given FFD is of resolution III, IV, or V it is 
sufficient to consider, instead of D, the unreplicated design Do, which contains 
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the same treatment combinations as D except that each treatment combination 
is represented only once. Suppose Do contains Nq treatment combinations. We 
then replace (13.26), using obvious notation, by 

( M \ 

E[y Do \ = (3,X iDo ,X 2DQ ) l£i I (13.27) 

We shall now consider (13.27) and the resulting NE for each of the three 
situations. 


13.6.2 Resolution III Designs 

For a resolution III design we have, of course, ^2 = 0 and (13.27) reduces to 
the two-part model (dropping the subscript Dq for convenience) 


E{y) = (3,X x ) 



and the NE are of the form 

\X\0 x\xj \eJ = 


(13.28) 


(13.29) 


Suppose we have n factors A\, A 2 ,..., A n and factor A/ has Sf levels (i = 
1,2,, n), where the 〜 need not be all different. Then E\ has q\ = s i 
components, but for factor A/ the Si components add to zero (/ = 1,2,, n). 
Hence E\ contains only — l) = q\ — n independent components. It fol¬ 

lows then also that the coefficient matrix in (13.29) is not of full rank. A solution 
to (13.29) can be written as 


m\ _ n'3 V 
El) = X\xJ 


or, if we write X* = (3, X\), 



= (d_ X^y 


(13.30) 


(13.31) 


It is obvious then that for the mean and all main effects to be estimable we must 
have 


rank (XfX*) ^ l + qi - n (13.32) 

Condition (13.32) can in certain situations be achieved with Nq = 1 + qi — n 
treatment combinations, and we shall give some examples later (see Chapter 14). 
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For such a design no d.f. are available for estimating the variance in the 
context of the ANOVA. The design Do is then referred to as a li saturated 
design. If we use a design D with N runs where N > \ q\ — n and (13.32) is 
satisfied, then can, of course, be estimated with — (1 + — n) d.f. Using 

{X^ D X\ D )~ as the variance-covariance matrix we can then, within the factor 
space of the experiment, compare different treatments, for example, a{v) — a(w) 
for some treatment combinations v and w using the main effects model (13.28). 
For the analysis of a saturated design Do see Section 13.9. 

13.6.3 Resolution IV Designs 

For this case it is convenient to rewrite (13.27) (dropping again the subscript 

E(y) = (X u X*) ^ (13.33) 

x * 2 = (0,x 2 ) 五卜⑵ 

The reason to include M in the vector of nuisance parameters, is mostly 
one of mathematical convenience. It simply means that we are not interested in 
estimating the mean just as we are not interested in estimating 2-factor interac¬ 
tions. A plausible argument for proceeding in this fashion was given by Raktoe, 
Hedayat, and Federer (1981) who point out that because of the possible presence 
of 2-factor interactions we cannot use (13.27) in the same way as we used (13.28) 
to make comparisons among treatments. Our only aim here can be to estimate 
main effects and draw conclusions from that concerning the importance of the 
various factors. The important difference between estimating main effects from 
a resolution IV design as compared to a resolution III design is, of course, that 
in a resolution IV design 3-factor interactions are assumed to be negligible, that 
is, main effects are clear, whereas in a resolution III design 2-factor interactions 
are also assumed to be negligible. 

Returning now to model (13.33) the NE are 

/^i^i x[ x *i\ ( E i\ = ( x \y 

\Ey ~ 

Using results of Chapter 1.4 the RNE for E\ are given by 

/ V V f v* ( V*/ V *、 一 V*, V \ 17 

I A ^ A i - ^1^2 V^2 ^2/ ^2 A i I tL\ 

=(z ； - X\X* 2 (X* 2 'Xl)~ X*Ay (13.35) 


(13.34) 


Do) as 


with 
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or 

CE { = Q (13.36) 

where C and Q are defined in an obvious way. For all main effects to be estimable 
we must have 

rank (x\X { - X\X* 2 (X^XX)~ d) = [(■?,_ - l) = qi - n (13.37) 

i 

To illustrate these ideas we consider the following example. 

Example 13.6 Suppose we have four factors, A, B, C, F, each at two levels 
and we have a ^ replicate Do = {(0 0 0 0), (0 1 1 0), (1 0 1 0), (1 1 0 0), (0 0 1 
1), (0 10 1), (10 0 1), (111 1)} with eight treatment combinations. Notice that 
Do is based on the identity relationship I = ABCF. So we already know that Do 
is a resolution IV design. We now have 


= 4/4 X /2 + 2(^40A — 14) X 32^2 
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XA*/ Y ■氺 _ 

A. 2 -A 2 

(X^X * 2 ) - = 


8 Ad' n 

43,2 4 I 6 x I 2 + 2 p 6 0 f 6 -I 6 )x 0 2 ^ 2 

I °12 

- g ^12 3 /12 _ 


v7 v ■氺 / v*/v *、 一 v*/v o n 

A ^^2^2 ^2^ A 2 A i = Z J8^8 
and 

C = X[Xi - X\X* 2 (X%'X* 2 )~X* 2 Xi = 2/4 X 
It is clear then that 

rank C = 4 




and hence (13.37) is satisfied since = 8, n = 4. All main effects are estimable 
(which we knew already, of course). □ 

Let us now return to (13.36). Since C is not of full rank, we write a solution 
to (13.36) as 

E { ^CQ 

where C— is a generalized inverse of C. We also have for any estimable function 
c'Ex ‘ ^ 

var(〆 五 i) = c’C 一 co^ 

The estimable functions here are, of course, those belonging to the various main 
effects. To illustrate this we consider again Example 13.6. 


Example 13.6 (Continued) 


A generalized inverse for C is 
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■V 

A! 

Bo 

Bi 

Co 

Ci 

Fo 

_Fi_ 



1 1 -1 -1~| py(OOOO)— 

0 0 0 0 j(0110) 

1-1 1-1 j ( 1010 ) 

0 0 0 0 y(1100) 

-1 1 1-1 y ( 0011 ) 

0 0 0 0 y(0101) 

-1 -1 -1 -1 j ( 1001 ) 

0 0 0 0 」 Lv(llll) 一 
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with 


I ，-. 

var(A) = var(B) = var(C) = var(F) = ^°e 


This agrees, of course, with our earlier results. 


□ 


We noted that for resolution III designs the minimum number of distinct 
treatment combinations, that is, the number of treatment combinations in Do, is 
\ q\ — n, and designs with the number of runs can be constructed. No such 
general statement, simply based on the rank condition (13.37), can be made 
for resolution IV designs because of the presence of nuisance parameters, for 
example, 2-factor interactions. 

It is possible, however, to give a lower bound (LB) of the number of runs 
needed for a resolution IV design. For a x x • • • x Sq q factorial with 
s\ < S 2 < '' • < s q such a bound was derived by Margolin (1969) as 


LB = Sq 


1(1 




(13.38) 


(see also Seely and Birkes (1984) for a comment on its validity). Specifically for 
symmetrical factorials, that is, s n , we have the following lower bounds: 


5 LB 

2 2n 

3 3(2" - 1) 

4 4(3^ - 2) 

5 5(4n - 3) 


We can see that the design in Example 13.6 achieves this lower bound. But a 
quick look at Table 13.9 shows that only few of those designs achieve the lower 
bound, namely for 5 1 = 2, w = 8, € = 4 with 16 runs and s = 4, n = 6, 1 = 3 
with 64 runs. 


13.6.4 Foldover Designs 

Experiments are often initiated with resolution III designs because they are rel¬ 
atively small in size and provide some preliminary information about possibly 
active factors. However, the fact that main effects are not clear may make it 
sometimes difficult to interpret the results from such an experiment. One way to 
solve or partially solve this problem is to augment the original design with another 
fractional replicate such that the resulting design is a resolution IV design. For the 
2 jj^ fraction this can be achieved by using the foldover principle or producing 
a foldover design (Box and Wilson, 1951). 

The idea behind a foldover design simply is to replace in each treatment 
combination of the original fraction all 0 levels by 1 levels and all 1 levels by 0 
levels. We shall illustrate this principle in the following example. 
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Example 13.7 Consider the fraction given by 
I = ABCD = CDE = ABE 


and with the design obtained by solving the equations 


Xi + X 2 + X 3 + X 4 = 0 
X 3 -h X 4 + X 5 = 0 
Xl +X2 + X5 = 0 


This yields the design points 


(00000) (01101) 

(11110) (10011) 

(00110) (01011) 

(11000) (10101) 

The foldover design then is 

(11111) (10010) 

(00001) (01100) 

(11001) (10100) 

(00111) (01010) 


(13.39) 


(13.40) 


It is easy to see that the treatment combinations of the foldover design satisfy 
the equations 


^1 + ^2 + X 3 + X 4 = 0 
X3 + X4 + X5 = 1 
Xl +X 2 + X5 = 1 

Alternatively, we could have written (13.39) more precisely as 
I = ABCD = -CDE = -ABE 

and then the identity relationship for the foldover fraction becomes 

I = ABCD = CDE = ABE 

so that the identity relationship for the combined design is simply 

I = ABCD 

that is, we have obtained a 2jy 1 fraction. □ 

Within the practical context the augmentation of the original fraction by the 
foldover fraction results in a sequential design in that the two fractions can be 
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thought of as occurring in two different blocks. Formally this can be expressed 
by adding another factor to the original design and having that factor occur at the 
same level in the original fraction. Applying the foldover principle to the new 
design yields then the same results as described above. We illustrate this briefly 
with a continuation of Example 13.7 


Example 13.7 (Continued) Denote the new (blocking) factor by F. We then 
have a 2 6-3 fraction defined by 

I = ABCD = -CDE = -ABE = F = ABCDF = -CDEF = -ABEF 

The foldover design is thus defined by 

I = ABCD = CDE = ABE =-F = -ABCDF = -CDEF = -ABEF 

The two fractions combined are then defined by 

I = ABCD = -CDEF = -ABEF (13.41) 

hence yielding a 2jy 2 fraction. Assuming no block treatment interaction, we see 
from (13.41) that with respect to the original factors A,..., E we obtain again 
a 2jy 1 design as before. □ 

From Example 13.7 we can easily deduce the general idea associated with 
the foldover principle: Starting with a resolution III design, we know that the 
identity relationship contains one or more words of length 3 and other words 
of length > 3. Or, alternatively, the defining equations [such as (13.40)] contain 
three or more unknowns. Using the foldover principle changes the right-hand 
sides of all equations with an odd number of unknowns, but does not change the 
right-hand sides of equations with an even number of unknowns. This implies 
that the identity relationship for the final design contains only interactions with 
an even word length, the smallest of which will be 4. The foldover princi¬ 
ple is thus an easy way to achieve dealiasing of main effects and 2-factor 
interactions. 

Such dealiasing can also be achieved by constructing what we might call a 
partial foldover design. Rather than changing the level of each factor in each 
treatment combination, we may choose to change only the levels of a judiciously 
chosen subset of factors, in particular only one factor. This may have some merit 
from the practical point of view. 

It is, of course, clear that the foldover principle leads to a design with twice the 
number of design points as the original fraction. Especially for larger designs, 
that is, a large number of factors, this may be a disadvantage if not all main 
effects may need to be dealiased. Other techniques, such as optimality based 
techniques (see Wu and Hamada, 2000), may then be used instead by focusing 
on a particular model as suggested by the data from the original experiment. 
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13.6.5 Resolution V Designs 

For resolution V designs we wish to estimate the mean, main effects, and 2-factor 
interactions. Hence models (13.26) and (13.27) are appropriate. For purposes of 
our discussion we write the model for short as 



E[y] = X E 

(13.42) 

where 

X = (3,X U X 2 ) 

(13.43) 

and 

E = (M ， E\ ， E， 2 y 


with q = 

=1 + ^1 + ^2 parameters. The NE for model (13.42) is 



(X f X)E = X f y 

and we know that because of the parameterization used 

mnk(X f X) < q 

We also know that the number of degrees of freedom associated with the mean 
and all main effects and 2-factor interactions included in E is 

n 

1 + - 1) + — 1)( 〜 -1) = v(E) 

/=1 i<j 

say. For a design D to give rise to v(E) degrees of freedom for SS(X), the sum 
of squares due to fitting the model (13.42), and hence to the estimation of the 
mean, all main effects and 2-factor interactions, we must have 

rank(Z r Z) = v{E) 

A design D with v(E) runs can always be achieved, a result similar to that for 
resolution III designs. Such a design is obviously a saturated design. 

We illustrate this with the following example. 

Example 13.8 Consider n = 4 and 心 = 巧 = 幻 = 54 = 2. Then v(E) = 11, 
and it can be shown that the following (nonorthogonal) FFD allows the estimation 
of all main effects and 2-factor interactions: 


(0000) 

(1010) 

(1110) 

(0011) 

(1001) 

(1101) 

(1100) 

(0110) 

(1011) 

(1111) 

(0101) 
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13.7 FRACTIONAL FACTORIALS AND COMBINATORIAL ARRAYS 

The treatment combinations of a factorial design can conveniently be repre¬ 
sented in the form of an array or a matrix with n rows representing the factors 
Ai, A 2 ,..., A n and N columns representing the treatment combinations (runs). 
The elements of this array represent the levels of the factors, that is, if factor 
At has Sj levels, then the elements in the ith row take on values from the set 
Si = [0, 1,2,Si — 1} depending on which treatment combinations are part of 
the design. In this section we shall give an overview of several types of arrays 
and their relationship to certain types of fractional factorials. 

13.7.1 Orthogonal Arrays 

Let us consider an orthogonal array of size N ， with n constraints, .y levels, strength 
d, and index 入 , denoted by OA[A^, n, s, d\ X] (see Appendix C). From our dis¬ 
cussion above it follows that the rows represent the factors, the columns represent 
the runs, and each factor has 5 1 levels. We are considering here specifically the 
case N = s n ~ iL — Xs d . 

Rao (1947b, 1950) has shown that ford = p-\-q—l we can estimate from 
such an array main effects and interactions involving up to p factors if interactions 
involving q(q > p) or more factors are assumed to be negligible. Of particular 
interest are the following cases: 

d p q Resolution 

2 1 2 III 

3 13 IV 

4 2 3 V 

An extension of this result to asymmetrical factorial designs was given by 
Chakravarti (1956). Suppose we consider the sf 1 x x ... x Sg 8 factorial set¬ 
ting and we wish to construct a subset of size N from the complete asymmetrical 
factorial such that certain effects are estimable assuming that other effects (inter¬ 
actions) are negligible. The solution given by Chakravarti (1956) can be stated 
in the following theorem. 

Theorem 13.2 If orthogonal arrays OA[A^/, n [, si , d[ ; 入 /] (i = 1,2,g) 
exist, then the Kronecker product array of the g orthogonal arrays above yields 
a fractional factorial designs in 

S 8 d . 

n = n Ni= n Xi sf i 

i=l i=l 1 

runs. If the /th orthogonal array is of strength d[ = pt qi — \ (with pt , qi 
as defined above), then in this Kronecker product array all main effects and 
interactions involving up to r = Xlf =1 n (0 < r/ < 0 < r < ^ pt) factors are 

estimable, where r/ factors are chosen from the ith group of factors at Si levels 

(/ = 1, 2, ... ， g). 
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As an illustration of this procedure we consider the following example. 

Example 13.9 (Chakravarti, 1956) Suppose we have three factors A\, A 2 , 
A 3 with two levels each and four factors B\, B 2 , B 4 with three levels each; 
that is, we have the 2 3 x 3 4 asymmetrical factorial. There exist resolution III 
designs for the individual systems with 4 and 9 runs, respectively, given by 
OA[2 2 , 3, 2, 2; 1] 



l 

2 

3 

4 

Ai 

0 

1 

1 

0 


0 

1 

0 

1 


0 

0 

1 

1 


and OA(3 2 ,4,3,2; 1) 



1 

2 

3 

4 

5 

6 

7 

8 

9 

Bx 

0 

0 

0 

1 

1 

1 

2 

2 

2 

b 2 

0 

1 

2 

1 

2 

0 

2 

0 

1 

Bi 

0 

2 

1 

1 

0 

2 

2 

1 

0 

Ba 

0 

1 

2 

0 

1 

2 

0 

1 

2 


which correspond to the identity relationships 

/ = A1A2A3 
and 

I = B1B2B2 = B2B2B4 = BxB^B^ = B i 

respectively. The Kronecker product design is then obtained by combining every 
column of OA[2 2 , 3, 2, 2; 1] with every column of OA[3 2 , 4, 3, 2; 1] resulting 
in an array of n\ n 2 = 1 rows and A/"iA ^2 = 4 • 9 = 36 columns. Since p\ = 
P2 = 1 and q\ = q: = 2, it follows that in addition to the main effects only 
2-factor interactions involving one factor A/ (/ = 1, 2, 3) and one factor Bj(j = 
1, 2, 3, 4), each with 2 d.f” are estimable assuming that all other interactions are 
negligible. □ 

This method can produce a large number of asymmetrical fractional factorials, 
but, as is the case with many Kronecker product designs, the number of runs 
becomes quite large. For this reason we mention only a few other special cases 
that lead to useful designs with the same properties as the design in Example 13.9: 

1. For the 2 n - 5 1 complete factorial (3 < n < 7) combine each level of the 
factor with s levels with each column of the array OA[ 8 , n, 2, 2; 2] to 
obtain a design with 8 s runs. 

2. For the 2 n - s complete factorial (1 < n < ll) combine each level of the 

factor with s levels with each column of the array OA[12, n, 2, 2; 3] to 

obtain a design with 12 s runs. 
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3. For the 2 ni - 3 ni complete factorial with 3 < n\ <1 and 1 < ^2 < 4 obtain 
the Kronecker product design of OA[8, n\, 2, 2; 2] and OA[3 2 , n 2 , 3, 2; 1] 
with 72 runs. 

For a more general characterization of designs in the above classes as well as 
other factor combinations, see Chakravarti (1956). 

13.7.2 Balanced Arrays 

Just as orthogonal arrays are related to fractional factorial designs, so are balanced 
arrays (see Appendix C). 5-arrays of strength 2, 3, 4 represent resolution III, IV, 
V designs, respectively. As an illustration consider the following example. 

Example 13.10 BA(5, 4, 2, 2; (1, 1, 2)) is given by 

/0 0 1 1 l\ 

0 10 11 
#=01101 
\0 1 1 1 0/ 

This represents a main effect plan for four factors in five runs. □ 

However, contrary to fractional factorial designs in the form of orthogonal 
arrays, the designs from 5-arrays are nonorthogonal, but as shown by Chakravarti 
(1956) and Srivastava (1965) the variance - covariance matrix for estimates of the 
parameters is balanced. For Example 13.10 this means that var(A / 上 is 上 he same 
for each / = 1, 2, 3, 4, cov(/x, A/) is the same for each i, and cov(A/, Aj) is the 
same for each pair (/, j) with i _ j . 

In a series of studies Srivastava and Chopra (1971) and Chopra and Srivastava 
(1974, 1975) have provided methods of constructing B-arrays of strength 4 and a 
list of resolution V designs for 4 < n < 8 factors and a wide range of N runs. 

13.8 BLOCKING IN FRACTIONAL FACTORIALS 
13.8.1 General Idea 

As we have discussed, the purpose of FFDs is to reduce the number of experi¬ 
mental runs consonant with the objectives of the experiment. That may still leave 
us with a relatively large number of treatment combinations such that in the prac¬ 
tical setting some form of blocking may be required. To achieve such blocked 
designs we simply need to combine the ideas of systems of confounding and 
fractional factorials. The concept can be described in general terms as follows. 

Let us consider an unreplicated s n ~^ fractional factorial to be used in s m 
blocks. We then know the following: 

1. We have s n ~^ treatment combinations. 

2. The available treatment combinations are determined by i independent 
interaction components in the identity relationship. 



550 


FRACTIONAL FACTORIAL DESIGNS 


3. We can estimate {s n ~ i — 1)1 {s — 1) functions of main effects and interac¬ 
tion components. 

4. The estimable functions are determined by the identity relationship in the 
form of the alias structure. 

5. We now have s m blocks of size 

6 . We have s m — l d.f. among blocks. 

7. We need to confound (s m — l)/(s — 1) interaction components with blocks. 

8 . From the available estimable functions (as given by the alias structure) we 
need to select m functions to confound with blocks. 

9. Then all their generalized interactions (and aliases) are also confounded 
with blocks. 

We shall illustrate these steps and some other ideas in the following sections with 
particular reference to the 2 n and 3 n systems. 

13.8.2 Blocking in 2 n ~ l Designs 

Consider a 2 n ~ l FFD defined by the identity relationship (see Section 13.3.1) 

/ = E ai = E ai = • . . = E a£ = £； ai+a 2 = • . •= 五 ai+a2 + …+叫 (13.44) 

where E ai , E a2 ,..., E ae are the independent interactions. We shall denote the 
2 n ~ l — l = q estimable effects by E 择 2 ,..., E^. The alias structure is 

obtained by multiplying each E^(i = 1,2,... ,q) into (13.44). 

We now consider placing the treatment combinations specified by (13.44) into 
2 m blocks of size To accomplish this we need to select m effects from 

among the E〜(i = 1,2,q) to confound with blocks, say, without loss of 
generality, E^, E ^ 2 ,..., E^ m . We know that then all their generalized interac¬ 
tions are also confounded with blocks, giving rise to the required 2 m blocks. 

To generate the intrablock subgroup (IBS) we combine the i equations for 
generating the fraction specified by (13.44) and the m equations for the specified 
system of confounding, that is, 

OC\\X\ + Oi\2X2 H - h OiinXn = 0 

0^21^1 + 0122X1 H - h OtlnXn = 0 

Oiiixi + a ， aX2 H - h ai n x n = 0 

Pl\Xi + P\2 x 2 + … + Pln x n = 0 (13.45) 

^ 21^1 + P22X2 + . . . + PlnXn = 0 


+ ^m2^2 + . . . + ^mn^n = ^ 
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Equations (13.45) show that the IBS constitutes a 2 n ~^ +m ^ fraction, with the 
remaining blocks representing the other fractions obtained by changing succes¬ 
sively the right-hand sides of the second set of equations in (13.45). 

In the practical context the interactions E^(i = 1, 2,..., m) would have 
to be chosen carefully so as to avoid the confounding of “important” effects 
with blocks. We shall illustrate the preceding procedure with the following 
example. 


Example 13.11 Consider the 2 7-3 design in four blocks. We start with the 
resolution IV design defined by the identity relationship 

I = ABCD = CDEF = ABEF = BCEG = ADEG = BDFG = ACFG 


where the three independent interactions are underlined. The 15 estimable func¬ 
tions (d.f.) are accounted for by seven main effects (+ 3-factor and higher-order 
interactions), seven 2-factor interactions (+ higher-order interactions), and one 
3-factor interaction (+ higher-order interactions). In order to set up blocks of 
size 4 we need to confound three 2-factor interactions (and their aliases), choos¬ 
ing, say, AB and AC as the two independent interactions. Thus, to generate the 
design we chose in (13.45) 

< = (1 1 1 1000 ) a f 2 = ( 0011110 ) a ' 3 = (01 10101 ) 

= (1 100000) )8； = (1010000) 

An isomorphic design using SAS PROC FACTEX is given in Table 13.10. □ 

13.8.3 Optimal Blocking 

As we have mentioned earlier, there may be various ways to set up the blocks 
for a particular fractional factorial, for example, by choosing different )8 ; to use 
in (13.45). Such choices for the 2 n ~ l fraction in 2 m blocks may be made by 
first inspecting the alias structure for the 2 n ~ l and then selecting m appropriate 
interactions to generate the system of confounding. We obviously need to be 
careful in this selection so that some of the resulting 2 m — m — \ generalized 
interactions do not end up being desirable effects. 

A more formal approach to this problem makes use of the fact that blocking in 
a 2 n ~ l can be considered as a special case of fractionation (Kempthorne, 1952; 
Lorenzen and Wincek, 1992) by introducing additional factors as blocking factors. 
More specifically, a 2 n ~^ fractional factorial in 2 m blocks of size < 

n — V) can be considered formally as a 2( w+/n ) _ ( £+m ) (Chen and Cheng, 1999). 
The factors are divided into n treatment factors A\, A 2 ,..., A n and m blocking 
factors b\, b 2 , … ， b m . The 2 m “level” combinations of the blocking factors are 
used to divide the 2 n ~^ treatment combinations into 2 m blocks. By treating this 
as a special case of a fractional factorial, the identity relationship will contain two 
types of words: treatment-defining words are those that contain only treatment 
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Table 13.10 2 7-3 Design in 4 Blocks 


options nodate pageno=l ； 

proc factex ； 

factors A B C D E F G; 

size design=16 ； 

model res=4 ； 

blocks size=4 ； 

examine design aliasing(3) confounding ； 
output out=design 

blockname=block nvals=(1 2 3 4); 
titlel 'TABLE 13.10'; 

title2 '2** (7-3) DESIGN IN 4 BLOCKS'; 
run ； 

proc print data=design ； 
run ； 


The FACTEX Procedure 
Design Points 


Experiment 

Number A B C D E F G Block 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 


-1 

1 

1 

-1 

-1 

1 

1 

-1 

-1 

1 

1 

-1 

-1 

1 

1 


Factor Confounding Rules 


E = B*C*D 
F = A*C*D 
G = A*B*D 


4 

1 

1 

4 

3 

2 

2 

3 

3 

2 

2 

3 

4 
1 
1 
4 
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Table 13.10 (Continued) 




Block Pseudofactor Confounding Rules 







[Bl] = A*B*C*D 









[B2] = C*D 




Aliasing : 

Structure 







A = 

B*D*G 

= 

B*E*F = C*D*F = 

C*E*G 





B = 

A*D*G 

= 

A*E*F = C*D*E = 

C*F*G 





C = 

A*D*F 

= 

A*E*G = B*D*E = 

B*F*G 





D = 

A*B*G 

= 

A*C^ 

= B*C*E = 

E*F*G 





E = 

A*B*F 

= 

A*C*G = B*C*D = 

D*F*G 





F = 

A*B*E 

= 

A*C*D = B*C*G = 

D*E*G 





G = 

A*B*D 

= 

A*C^ 

= B*C*F = 

D*E*F 




[B]= 

A*B 

=D*G 


E*F 







A*C 

=D*F 

= 

E*G 







A*D 

=B*G 

= 

C*F 






[B]= 

A*E 

=B*F 

= 

C*G 






[B]= 

A*F 

=B*E 

= 

C*D 







A*G 

=B*D 

= 

C*E 







B*C 

=D*E 

= 

F*G 







A*B*C = A*D*E = 

A*F*G = B*D*F = B*E*G 

= 

C*D*G = 

C*E*F 

Obs 


block 


A 

B C 

D 

E 

F 

G 

1 


1 


-1 

-1 -1 

1 

1 

1 

1 

2 


1 


-1 

-1 1 

-1 

1 

1 

-1 

3 


1 


1 

1 -1 

1 - 

1 

-1 

1 

4 


1 


1 

1 1 

-1 - 

1 

-1 

-1 

5 


2 


-1 

1 -1 

1 - 

1 

1 

-1 

6 


2 


-1 

1 1 

-1 - 

1 

1 

1 

7 


2 


1 

-1 -1 

1 

1 

-1 

-1 

8 


2 


1 

-1 1 

-1 

1 

-1 

1 

9 


3 


-1 

1 -1 

-1 

1 

-1 

1 

10 


3 


-1 

1 1 

1 

1 

-1 

-1 

11 


3 


1 

-1 -1 

-1 - 

1 

1 

1 

12 


3 


1 

-1 1 

1 - 

1 

1 

-1 

13 


4 


-1 

-1 -1 

-1 - 

1 

-1 

-1 

14 


4 


-1 

-1 1 

1 - 

1 

-1 

1 

15 


4 


1 

1 -1 

-1 

1 

1 

-1 

16 


4 


1 

1 1 

1 

1 

1 

1 


factors, and block-defining words are those that contain at least one blocking 
factor (Chen and Cheng, 1999). We shall illustrate this in the following example. 


Example 13.12 Consider the 2 7 — 3 in 2 2 blocks as a 2 ( 7+2 )_( 3+2 ) fraction. We 
need three independent treatment-defining words, say A 2 A 3 A 4 A 5 , AiA 3 A 4 A 6 , 
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and AiA 2 A 4 A 7 , and two independent block-defining words, say A 1 A 2 A 3 A 4/71 
and A 3 A 4 Z 72 - To obtain the desired design we thus need to solve the following 
equations: 


X2 + A + X4 + X5 

x\ + X 3 + X 4 + xe 

X\ + X2 + X 4 + Xj 

XI + X2 + X3 + X4 
X 3 + X 4 


+ ^1 


= 0 
= 0 
= 0 
= 0 

+ ^2 = 0 


(13.46) 


where in (13.46) xi, ^ 2 ,..., X 7 refer to the treatment factors and y\, refer to the 
block factors, with xi = 0, 1, (/ = 1, 2,..., 7), yj = 0, l(j = 1,2). The solution 
of Eqs. (13.46) leads to the design given in Table 13.10 if we use the following 
association between block numbers and block factor level combinations: 


Block No. 

yi 

yi 

1 

1 

1 

2 

0 

1 

3 

1 

0 

4 

0 

0 


Inspection of the identity relationship in Example 13.12 illustrates some of 
the difficulties encountered in extending the notions of resolution and minimum 
aberration to situations involving block-defining words. The presence of a three- 
letter word, for example, would suggest that the design is a resolution 

III design, when in fact it is clearly a resolution IV design with regard to the 
treatment factors. Similar problems have been addressed and solutions proposed 
by, for example, Bisgaard (1994), Sitter, Chen, and Feder (1997), and Sun, Wu, 
and Chen (1997)，but as Chen and Cheng (1999) point out, none of these solu¬ 
tions are satisfactory with regard to properly characterizing blocked designs and 
identifying optimal blocking schemes. 

The problem is that there are two different word length patterns, one for 
treatment-defining words and one for block-defining words. Suppose we consider 
only designs where no main effects are confounded either with other main effects 
or block effects. Following Sun, Wu, and Chen (1997), let L，，o be the number 
of treatment-defining words containing i treatment letters, and let L/，i be the 
number of block-defining words also containing i treatment letters and one or 
more block letters. We then have, for a given design, two types of word length 
patterns, one for treatment-defining words 


WLP ? = (Z>3,o, [4,0,…) 


(13.47) 
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and one for block-defining words 

WLP^ = (L 2 ,i,L 3 ,i,...) (13.48) 

Clearly, (13.47) and (13.48) have different meanings and are of different 
importance. An intuitive method to proceed would be to search for a design 
that has minimum aberration (see Section 13.3.4) with respect to both treatment 
and block factors. Zhang and Park (2000), however, have shown that no such 
design exists. Instead, in order to extend the concept of minimum aberration we 
somehow may need to combine (13.47) and (13.48). To this end Chen and Cheng 
(1999) propose the following ordering, based on the hierarchical assumption that 
lower-order effects are more important than higher-order effects and that effects 
of the same order are of equal importance: 

(L3,o, ^2,l), [4,0, (^5,0, ^3,l), ^6,0, (^7,0, ^4,l) [8,0, … （ 13.49) 


This means, for example, that if L 3 ,o > 0 some 2-factor interactions are aliased 
with main effects, and if L〗，i >0 then some 2 -factor interactions are confounded 
with blocks. In either case those 2-factor interactions cannot be estimated. Further, 
designs with L 3 0 > 0 and/or L 2,1 > 0 are clearly less desirable than designs with 
^ 3,0 = ^ 2,1 =0 and L^o > 0. 

Based on such considerations, the criterion of minimum aberration for 
blocked fractional factorial designs might be defined by sequentially mini¬ 
mizing 乙 3 , 0 ，乙 2 ， 1 ，乙 4 , 0 ，乙 5 , 0 ，乙 3 ， 1 ，乙 6 , 0 ，乙 7 , 0 ，乙 4 ， 1 ，乙 8 , 0 , ... • But Chen 
and Cheng (1999) show that even that is not quite satisfactory. Based on 
the notion of estimation capacity [for details see Cheng, Steinberg, and Sun, 
(1999)]，they propose to minimize sequentially what they call the blocking 
word length pattern: 



WLP 6 = (Z^ ， L b 4 ” . 

L b \ 

， ^n+[n/2\ ) 

(13.50) 

where 

L J.O 

for even j < n 


l j = ■ 

((J. + 1)/2) ~,o + ^0+l)/2,l 

for odd j < n 



[ Lj-[ n / 2 i\ for w + 1 < y < « + [n/2] 

(13.51) 

and [n/2] is the largest integer less than or equal to n/2. As can be seen 
from (13.51) the word length for odd j < n is a. linear combination of the 
two associated word lengths enclosed in parentheses in (13.49). For example, 
L\ = 3L3,o + [ 2 ，i and = 10 L5 ,o + [ 3 , 1 . 

For detailed methods of identifying minimum aberration blocked fractional 
factorial designs we refer the reader to Chen and Cheng (1999). In Tables 13.1 1 ， 
13.12, and 13.13 we give, in a different form, some of their results for 8-, 
16- ， 32-run minimum aberration 2 n ~ l designs in 2 m blocks for w = 4, 5, … ， 10; 
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Table 13.11 8-Run Minimum Aberration 2 n ~^ Designs 





Independent Defining Words 




n 


t 

m Treatments 

Blocks 


WLP ， 

4 


1 

1 ABCD 

ABbi 


2 

10 

4 


1 

2 ABCD 

ABb\, ACb2 


6 

10 

5 


2 

1 ABD,ACE 

BCbi 


8 

1 2 

6 


3 

1 ABD,ACE,BCF ABCb\ 


15 3 4 

a Giving only L 3 , Lj, L 

b 

5. 





Table 13.12 16-Run Minimum Aberration 2 n ~^ Designs 







Independent Defining Words 




n 

i 

m 

Treatments 

Blocks 

wuv 

a 

5 

1 

1 

ABCE 

ABDb\ 

0 

1 

2 

5 

1 

2 

ABCE 

ABbi,ACDb 2 

2 

1 

4 

5 

1 

3 

ABCE 

ABb i, ACb2, ADb^ 

10 

1 

0 

6 

2 

1 

ABCE,ABDF 

ACDbi 

0 

3 

4 

6 

2 

2 

ABCE,ABDF 

ABbi,ACDb 2 

3 

3 

8 

6 

2 

3 

ABCE,ABDF 

ABb\, ACb2, ADb^ 

15 

3 

0 

7 

3 

1 

ABCE,ABDF, 

ACDG 

BCDbi 

0 

7 

7 

7 

3 

2 

ABCE,ABDF, 

ACDG 

ABb\,ACb2 

9 

7 

0 

7 

3 

3 

ABCE,ABDF, 

ACDG 

ABb\, ACZ? 2 , ADb^ 

21 

7 

0 

8 

4 

1 

ABCE,ABDF ， 

ACDG,BCDH 

ABbi 

4 

14 

0 

8 

4 

2 

ABCE,ABDF, 

ACDG,BCDH 

ABb\,ACb2 

12 

14 

0 

8 

4 

3 

ABCE,ABDF, 

ACDG,BCDH 

ABb\, ACZ? 2 , ADb^ 

28 

14 

0 

9 

5 

1 

ABE,ACF, 

ADG,BCDH, 

ABCDJ 

BCb u 

16 

14 

84 

9 

5 

2 

ABE,ACF, 

ADG,BCDH, 

ABCDJ 

BCb\, BDb2 

24 

14 

92 

10 

6 

1 

ABE,ACF, 

BCG’ADH ， 

BCDJ,ABCDK 

BDbi 

28 

18 

96 

10 

6 

2 

ABE,ACF, 

ABCbi,BDb 2 

37 

18 184 




BCG’ADH ， 

BCDJ,ABCDK 






Giving only L 3 , Lj, l\. 
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Table 13.13 32-Run Minimum Aberration 2 n ~^ Designs 





Independent Defining Words 



n 

£ 

m 

Treatments 

Blocks 

wuv 

6 

1 

1 

ABCDEF 

ABCbi 

0 

0 2 

6 

1 

2 

ABCF 

ABDbi,ACEb 2 

0 

1 4 

6 

1 

3 

ABCDEF 

ABb\, CDb2, ACEb^ 

3 

0 8 

6 

1 

4 

ABCDEF 

ABb\,ACb2, 

ADb^,AEb\ 

15 

0 0 

7 

2 

1 

ABCF,ABDEG 

ACDb\ 

0 

1 22 

7 

2 

2 

ABCF, ABDG 

ACDbx,ABEb 2 

0 

3 7 

7 

2 

3 

ABCF, ABDEG 

ACb\, ABDb:, 

ABEb 3 

5 

1 32 

7 

2 

4 

ABCF,ADEG 

ABb\,ACb2 

ADb^,AEb\ 

21 

2 0 

8 

3 

1 

ABCF, ABDG ， 
ACDEH 

ABEb\ 

0 

3 43 

8 

3 

2 

ABCF, ABDG, 
ACDEH 

ABEbi,BCDEb 2 

1 

3 50 

8 

3 

3 

ABCF, ABDG ， 
ACEH 

ABbuACDb:, 

AEb 3 

7 

5 18 

8 

3 

4 

ABCF 1 ， ABDG ， 
ACEH 

ABb\, ACb2, 
ADb^.AEb^ 

28 

5 0 

9 

4 

1 

ABCF, ABDG ， 
ABEH,ACDEJ 

BCDEbi 

0 

6 84 

9 

4 

2 

ABCF, ABDG ， 
ACEH,ADEJ 

BCbi,BDEb 2 

2 

9 14 

9 

4 

3 

ABCF, ABDG ， 
ACEH,ADEJ 

ABb\, ACDb2, 

AEb^ 

9 

9 27 

9 

4 

4 

ABCF, ABDG ， 
ACEH,ADEJ 

ABb\,ACb2, 

DEb3 ， AEb4 

36 

9 0 

10 

5 

1 

ABCF, ABDG ， 
ACEH,ADEJ, 
ABCDEK 

ACDbi 

0 

15 10 

10 

5 

2 

ABCF, ABDG ， 
ACEH,ADEJ, 
ABCDEK 

ABb u ACDb 2 

3 15 20 

10 

5 

3 

ABCF, ABDG ， 
ACDH ， ABEJ ， 
ACER 

ABb\,ACb2, 

ADEb 3 

12 16 36 

10 

5 

4 

ABCF, ABDG ， 
ACEH,ADEJ, 
ABCDEK 

ABb\,ACb2 

ADb^,AEb\ 

45 

15 0 


Giving only L 3 , L b A , L b 5 . 
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t = 1,2 •. • ， 6; m = 1, 2, 3, 4. For possible combinations of n,i, and m we give 
the independent words for the identity relationship from which the design can 
be obtained by solving a set of equations as illustrated in Example 13.12. We 
also list the first three components, L^, and of each associated WLP^. 
In Tables 13.11 ， 13.12, and 13.13 we denote the treatment factors by A, B, C, 
D, E, F, G, H, J, K and the blocking factors by b\, b 2 , b、, b^. 

13.9 ANALYSIS OF UNREPLICATED FACTORIALS 

One reason for using fractional factorial designs is to reduce the number of runs 
in the experiments. But even a highly fractional design for a large number of 
factors may involve quite a large number of runs. In that case we may be able 
to afford only a few or no replications of the treatment combinations included in 
the design. As we know replications are important for estimating the variance of 
the estimable effects and interactions. 

For unreplicated 2 n factorials we have described how negligible interactions 
can be used to estimate (see Section 7.6.6). This method is, of course, applica¬ 
ble also for the general s n factorial and mixed factorials. The problem becomes 
more complicated for fractional factorials where often only main effects and 
low-order interactions are estimable. In this case we have to fall back on the 
assumption of effect sparsity, which says that typically only a few effects and/or 
interactions are real. We shall mention here briefly the essential ideas of two 
methods that are helpful with the analysis of data and the interpretation of the 
results from unreplicated fractional factorials. 

13.9.1 Half-Normal Plots 

Let us consider a 2 n ~^ fraction and suppose that the effects and interactions E ai , 
E 012 , .. E am (m = 2 n ~^ — 1) are estimable (these are, of course, aliased with 
other interactions which we assume to be negligible). Noting that | E ai | = 

I E? — Eq 1 I can be viewed as the range of a pair of (approximately) nor¬ 
mally distributed random variables suggests that the | E ai \ are themselves half- 
normally distributed, that is, follow the distribution function 

fix) = [2/(jt CT e 2 )] 1/2 exp[-x 2 /(2 a])\ for x > 0 
= 0 for x < 0 

Its cdf can be linearized by using the upper half of normal probability paper, 
relabeling the P axis for P > .5 as P f = 2P — 1. Based on this idea Daniel 
(1959) proposed a procedure that has come to be known as half-normal plot. 

It consists of plotting the absolute values of the m estimates, approximating 
their jP’ values by P f = {i — j)/m for / = 1, 2,, m. If，as assumed, most of 
the estimates are, in fact, estimating zero, then they should fall near a straight 
line through the origin. The 68% point (obtained from P f = 2 x .84 — 1 = .68) 
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would provide an approximate value of the standard error, say SEo, of the effects 
plotted, that is, an estimate of since SE(E ai ) = a e /(2 n ~^~ 2 ) 1 ^ 2 . Furthermore, 
to the extent that some of the largest estimates deviate from the straight line is 
an indication that the corresponding effects are different from zero. 

The above describes in essence the procedure, but we need to elaborate on 
two points: (1) How do we judge an estimate to be significantly different from 
zero? and (2) should we reestimate in light of some significant estimates? 

1. The significance of effects is judged by whether they fall outside certain 
“guardrails” (Daniel, 1959). The a-level guardrails are constructed by draw¬ 
ing a line through the a-level critical values for the largest standardized 
absolute effects, say th(h = m, m — 1,..., m — K), standardized by the 
initial estimate of o e . Zahn (1975) provided the critical values for m = 15, 
31, 63, 127 and a = .05, .20, .40. In Table 13.14 we give the critical values 
for m = 15, 31 and a = .05, .20, .40 for some 

2. Based upon the test performed by using the procedure in point 1 above, 
we exclude not only the significant effects at a = .05 but also “borderline” 
effects at a = .20 or even a = .40, say there are K a such effects altogether, 
and replot the m — K a remaining effects. The value at P’ = .68 provides 
then the final value for SE(E ai ), say SEf, and from it we obtain the final 
estimate of o e [for a more refined explanation see Zahn (1975)]. 


The half-normal plot procedure is illustrated in the following example. 


Example 13.13 The following data are taken from Example 10.4 in Davies 
(1956), which describes a 2 5 — 1 factorial investigating the quality of a basic 
dyestuff. The factors are A (temperature), B (quality of starting material), C 


Table 13.14 Guardrail Critical Values 


m = 15: 

h 

of = .05 

a = .20 

a = .40 


15 

3.230 

2.470 

2.066 


14 

2.840 

2.177 

1.827 


13 

2.427 

1.866 

1.574 


12 

2.065 

1.533 

1.298 

m = 31: 

h 

of = .05 

a = .20 

a = .40 


31 

3.351 

2.730 

2.372 


30 

3.173 

2.586 

2.247 


29 

2.992 

2.439 

2.121 


28 

2.807 

2.288 

1.991 


27 

2.615 

2.133 

1.857 
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(reduction pressure), D (pressure), and E (vacuum leak). The |-fraction is based 
on / = —ABCDE. The treatment combinations and observations Y are given in 
Table 13.15. ^ 

Pertinent computed values to draw the half-normal plot are given in Table 
13.16, and the half-normal plot is given in^Figure 13.1. 

From Table 13.16 we see that SEo = BC = 8.31, the llth-order statistic, is 
closest to the .68 percentile and hence is the initial estimate of SE (云 ai ). The 
standardized effects are then obtained as t[ =| E ai \ /8.31. The guardrails for 
a = .05, .20, and .40 are constructed by drawing lines through the corresponding 


Table 13.15 2 5-1 Fractional Factorial 



Treatment 

Y 

Treatment 

Y 


00000 

201.5 

00011 

255.5 


10001 

178.0 

10010 

240.5 


01001 

183.5 

01010 

208.5 


11000 

176.0 

11011 

244.0 


00101 

188.5 

00110 

274.0 


10100 

178.5 

10111 

257.5 


01100 

174.5 

01111 

256.0 


11101 

196.5 

11110 

274.5 

Table 13.16 Calculations for Half-Normal Plot of Figure 13.1 

Effect 

E ai 

Order(/) 

(i - .5)/15 

1 t t 

DE 

- .0625 

1 

•0333 

.00752 

A 

.4375 

2 

.1000 

.05263 

AE 

-2.3125 

3 

.1667 

.27820 

AC 

3.0625 

4 

.2333 

•36842 

BD 

-3.5625 

5 

•3000 

.42857 

E 

3.9375 

6 

.3667 

.47368 

CE 

-4.6875 

7 

.4333 

.56391 

AD 

5.1875 

8 

.5000 

.62406 

B 

-7.5625 

9 

.5667 

.90977 

BE 

7.6875 

10 

.6333 

.92481 

BC 

8.3125 

11 

•7000 

1.00000 

C 

14.0625 

12 

.7667 

1.69173 

CD 

14.3125 

13 

.8333 

1.72180 

AB 

16.6875 

14 

.9000 

2.00752 

D 

66.6875 

15 

.9667 

8.02256 
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i 

o 

0.045 0.136 0.227 0.318 0.409 0.500 0.591 0.682 0.773 0.864 0.954 

Percentile 

Figure 13.2 Half-normal plot for nonsignificant effects. 

critical values for ?i 3 , and t\ 2 . From Figure 13.1 we see that for a = .05 

only D is significant, for a = .20 D and C are significant, and for a = .40 D, C, 
CD, and AB are significant. Using the latter result, we obtain the final estimate 
of a e from the .68 percentile of the half-normal plot for the 11 smallest effects in 
Figure 13.2. From it we see that AD = 5.188 = SE(E ai ) = a'^/(2 5_1_2 ) 1 / 2 = 
SEy, and hence a e = 2 x 5.188 = 10.376. □ 


2 


7 







6 5 4 3 

lo a)3=LlJa) 3losqv 
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13.9.2 Bar Chart 

This procedure, proposed by Lenth (1989)，is based on a simple formula for the 
standard error of the estimated effects. The effects are plotted and the usual t 
procedures are used to judge significance and interpret the results. 

Let 

50 = 1.5 x median | E Ui \ 

and define the pseudostandard error (PSE) of the estimated effect by 

PSE= 1.5 x r-median | E ai \ (13.52) 

where r-median is the restricted median over all | E ai \ < 2.55 , o* PSE (corre¬ 
sponding to SEf in Section 13.9.1) in (13.52) is an estimate of cr e /(2 n ~^~ 2 ) 1 ^ 2 . 
It can be used to define two useful quantities that in turn can be used to assess 
the significance of effects. 

Let 

ME = ? 9 75 ,j x PSE (13.53) 

denote the margin of error for E ai , where ? 975 ^ is the .975th quantile of the t 
distribution with d = m/3 d.f. (m = 2 n ~^ — 1). An approximate 95% confidence 
interval for E ai is then given by E ai 士 ME. A simultaneous margin of error 
(SME) is defined: 

SME = t y4 x PSE (13.54) 

where y = (1 + .95" m )/2. For some values of m we have 


m 

t.975,d 


7 

3.76 

9.01 

15 

2.57 

5.22 

31 

2.22 

4.22 

63 

2.08 

3.91 


The procedure proposed by Lenth (1989) then is as follows: 

1. Construct a bar chart of the signed effects. 

2. Add reference lines at 士 ME [from (13.53)] and 士 SME [from (13.54)]. 

3. Judge an effect as clearly active if E ai extends beyond the SME lines, 
inactive if E ai does not extend beyond the ME lines, and of debatable 
significance if E ai falls between the ME and SME lines. 
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Figure 13.3 Bar chart for Example 13.14. 


We illustrate this procedure in the following example. 

Example 13.14 Consider the data from Example 13.13. We obtain sq = 
1.5 x 5.1875 = 7.78, PSE = 1.5 x 4.9875 = 7.48, ME = 19.23, SME = 39.05. 
The bar chart is given in Figure 13.3. From it we see that only D is significant 
at a = .05. This agrees with the result in Example 13.13. □ 


13.9.3 Extension to Nonorthogonal Design 

The previous analysis procedures are applicable for orthogonal designs. There 
are, however, situations where we may encounter nonorthogonal designs, either 
by construction (see Section 14.4) or as a consequence of some mishap in the 
experiment. For example, we may have some replicates for one or a few treatment 
combinations. Or we may have missing values. Both cases lead to nonorthogonal 
designs, complicating the analysis and interpretation of results. 

A data-driven method for nonorthogonal designs was proposed by Kunert 
(1997), extending the method of Daniel (1959) as modified by Zahn (1975) and 
the method of Lenth (1989). For details we refer the reader to the original article 
by Kunert (1997). 
























CHAPTER 14 


Main Effect Plans 


14.1 INTRODUCTION 

Main effect plans (MEP) or resolution III designs are commonly used in 
exploratory studies when a large number of factors need to be considered or 
screened. The idea is to try to detect factors that exhibit “large” main effects 
and discard factors with no noticeable effects from further study. The crucial 
assumption here is that all interactions are negligible, including 2-factor inter¬ 
actions. This may not always be realistic, but as a first approximation this is 
nevertheless a very valuable method. In some cases it may be helpful to know 
with which 2-factor interactions each main effect is aliased. We shall make some 
remarks about that later. 


14.2 ORTHOGONAL RESOLUTION III DESIGNS FOR 
SYMMETRICAL FACTORIALS 

Among the resolution III designs those that admit uncorrelated estimates of main 
effects are of major importance. Not only are they easy to construct and analyze, 
but the results from such experiments are also easy to interpret. Because of their 
special feature, these designs are referred to as orthogonal resolution III designs 
or orthogonal main effect plans (OMEP). We shall now describe some methods 
of constructing OMEPs. 

14.2.1 Fisher Plans 

In discussing systems of confounding (see Section 11.4) for an s n factorial in 
blocks of size with n = (s^ — 1)/(5 — 1) we have already provided a method 
of constructing a l/s n ~^ fraction of an s n factorial. Such a FFD is a saturated 
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OMEP, designated by indicating that this is a l/s n ~^ fraction of an 

s n factorial resulting in s n _( n _0 = 〆 treatment combinations (runs, trials) and 
is of resolution III. Here s = p m and p is a prime. The method of construc¬ 
tion is based on results given by Fisher (1942, 1945) on how to obtain the 
IBSG for an s n factorial in blocks of size s^. The IBSG is then the OMEP. 
The method of construction guarantees that each level of each factor occurs 
the same number of times, namely and each level combination for any 

two factors occurs the same number of times, namely s^~ 2 (see Section 11.3). 
The latter is a sufficient condition for the estimates of the main effects to be 
uncorrelated. 

To see this, consider model (13.28) with X\ having the following properties: 

1. Each column has s^~ l unity elements, the remaining / 一 1 ^? — 1) elements 
are zero. 

2. The inner product of any two columns belonging to the same factor [i.e .， 
columns belonging to (A /)5 and (A/) y , 8, ) / = 0, 1,... ， s — 1; 8 ^ y] is 
equal to zero. 

3. The inner product of any two columns belonging to different factors [i.e., 
columns belonging to (A/)^ and (A；) y , i, j = 1, 2, ..n; / / j] is equal 
to s e ~ 2 . 


Considering (13.28) as a two-part linear model and using the result of 
Section 1.4.8 we can write the RNE for E\ as 



I - —^')XxEx = X\ 



(14.1) 


The coefficient matrix in (14.1) can be written alternatively as 

= ^ 1^1 

with 




(14.2) 


that is, X 1 is obtained from X \ by subtracting from each element the correspond¬ 
ing column mean. Because of property 1 each column mean is l/s. Because of 
properties 2 and 3, it follows then that 


x\Xi = diag 


n 
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with M an ^ x 5- matrix of the form 

M = s e ~ 2 (sl - 00 f ) 

Using the result from Theorem 1.1 we then have 

= diag(Af- ， M~, ... ， M~) (14.3) 

with 



Since (^x\ X \ ^ is the variance - covariance matrix for estimable functions 
associated with the various main effects, it follows then from (14.3) that 
estimable functions associated with different main effects, such as cs(Ai)s 
and for the two factors A/ and Aj, are uncorrelated. The same, of 

course, is true for the estimates of orthogonal contrasts associated with the main 
effects of the same factor, such as and with Q 办 = 0. 

The estimates of the estimable functions are obtained from (14.1) as 

E l = (x^X^X^y 

Examples of OMEPs that can be obtained by this method are given in 
Table 14.1. In this table n represents the maximum number of factors that can 
be accommodated in N = s l runs resulting in a saturated OMEP. It should 
be clear how this method can also be used to construct OMEPs for n* < n = 
(〆 一 1)/(^ — 1) factors in N runs. All we need to do is to construct the plan for 
n factors and then delete n — n* factors from that plan. The resulting design is 
obviously an OMEP, but it is no longer a saturated OMEP. Instead it provides 
(n — n*)(s — 1) d.f. for error. 


Table 14.1 Orthogonal Main 
Effect Plans (Fisher’s Method) 



N = 

2 7 

8 

2 15 

16 

2 31 

32 

3 4 

9 

3 13 

27 

3 40 

81 

4 5 

16 

4 21 

64 

5 6 

25 
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14.2.2 Collapsing Factor Levels 

The condition of equal frequencies for all level combinations for any pair of 
factors in an OMEP can be relaxed to that of proportional frequencies. This 
was shown first by Plackett (1946) and later by Addelman and Kempthome 
(1961) for asymmetrical factorials (see Theorem 14.1). This result leads to the 
construction of additional OMEPs by using what is called collapsing of factor 
levels. For example, we may collapse the levels of a three-level factor to those 
of a two-level factor by setting up the correspondence 

Three-Level Factor Two-Level Factor 

0 — 0 

1 — ^ 1 

2 — ^ 0 

A 一 9 

Using this correspondence we can, for example, use the 3 n J and obtain an 
OMEP for the 2 4 factorial with nine runs. 

14.2.3 Alias Structure 

Even though for resolution III designs we make the assumption that all interac¬ 
tions are negligible, it may in certain situations be useful to know with which 
2-factor interactions the main effects are confounded. The reason, of course, is 
that among interactions the 2-factor interactions are least likely to be negligible 
and hence to know the aliasing between main effects and 2-factor interactions may 
be helpful for diagnostic purposes (see also Section 13.6.4). We shall illustrate 
this later. 

The general method of investigating the alias structure is, of course, to make 
use of models (13.28) and (13.33). That is, we estimate the main effects from 
a given design using (13.31) and then consider the expected value of the esti¬ 
mates under model (13.33). To illustrate this we have from (13.31), with 
EX = (M, E[), 

E\ = (X^X\)~XXy 

and, using (13.33) 

E[X\E\] = X\(XX x\)~ XX {X\E\ + x 2 e 2 ) 

=X\E\ + X* l (X* l , X* l rX* l , X 2 E 2 

or, for an estimable function 

E[c'E\] = c'E\ + c , (X* l f X* l )-X* l , X 2 E 2 (14.4) 
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Expressions for this alias structure can be stated explicitly (see below) for certain 
OMEPs. In other cases computer programs may be useful to obtain expressions 
for (14.4). ‘ 

Since the OMEP s n ~^ n ~^ with n = (s^ — 1)/(^ — 1) is equivalent to the IBSG 
of an s n experiment in blocks of size s^, it is obvious that the interactions that 
are confounded with blocks are also the interactions included in the identity 
relationship. Altogether there are (s n ~^ — l)/(s — 1) interactions in the identity 
relationship, n — i independent interactions plus all their GIs. For our purpose 
here we are concerned only with knowing the 3-factor interactions contained in 
the identity relationship. Some 3-factor interactions are directly identified from 

(11.27) , namely those E y j for which the partition a ㈣ corresponds to a 2-factor 

interaction in the s l system. Others may be obtained as GIs among the n — l 
independent interactions (11.27). Now if, for example, is a 3-factor 

interaction in the identity relationship for some i\, h with 1 < ii, h < n 
and i\, h different and some 8 , y with 1 < 8 , y < s — l, then will be con¬ 
founded with (or an appropriate power of to bring it into standard 

form), and so forth. An alternative method, using existing computer programs, 
of obtaining the relevant alias structure is given below. We now illustrate our 
discussion with the following example. 

Example 14J Suppose s = 3, l = 2, and hence n = A. The 3^J 2 was given 
as the IBSG in Example 11.2. From Example 11.4 it follows that the identity 
relationship is given by 

I = Fi F 2 F^ = Fi F 2 2 F 4 2 = Fi F 3 F 4 ^ F 2 F 3 F^ (14.5) 

or, more precisely, 

I = (FiF 2 F^ 0 = (Fi 柃柃 ) 0 = ( 巧 F 3 F 4 ) 0 = (F 2 F 3 F^) 0 (14.6) 

where F 1 F 2 F 3 and FiF^F^ are the independent interactions obtained from 

(11.27) and F 1 F 3 F 4 , F 2 F 3 F 4 are their GIs. It follows then from (14.5) that 
the relevant aliasing for F\, for example, is given by 

F\ = F 2 F 2 = F 2 F 4 = F 3 F 4 

If the 2-factor interactions were indeed not negligible, then instead of estimating, 
for example, (FO 2 — (Fi)o we would actually estimate [see also (14.4)] 

m ) 2 - (Fi)o] + [(F 2 F 3 2 )i - (F 2 F 3 2 )o] 

+ [(^2^4)2 - (^2^4)0] + [(^3^4)1 -( 巧 &) 0 ] (14.7) 


making use of (14.6). Clearly, if all 2-factor interactions were nonnegligible 
using the OMEP would be useless. But suppose the investigator is fairly sure 
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that all 2-factor interactions except possibly F 2 x F 3 are negligible. Then we 
would have to be concerned about the confounding of F\ and F 2 F 3 and F 4 
and F 2 F 3 . If then an estimated contrast belonging to F\ were “large,” we may 
suspect that this is partly due to a corresponding contrast belonging to FiF .， and 
so forth. Of course, we have no way of knowing, but this may suggest at least 
further investigation, for example, by augmenting the original design to achieve 
the desired dealiasing. It is in this sense that knowledge of the alias structure can 
be helpful as a diagnostic tool. 

The form of the estimable function (14.7) can also be obtained by using the 
E-option in SAS PROC GLM. The (fictitious) data and the ensuing analysis are 
given in Table 14.2. 

Concerning the SAS input and output in Table 14.2 we make the following 
comments: 

1. The factors are now labeled A, B, C, D. 

2. The model includes all 2-factor interactions similar to the model for a reso¬ 
lution IV design. 

3. The option e\ lists all type I estimable functions (see comment 4) and pro¬ 
duces the type I sums of squares (see 1.4.11.1). 

4. Using a model similar to (13.23) (see also Section 7.2.1), we see that by 
putting L2 = 1 and all other Li = 0( / = 3, 5, 6 , 8 , 9 ， 11 ， 12) the following 
function is estimable: 

- ^2 + 士 [(_)00 + ( a^)oi + «)02 — (^)20 - O 卢 )21 _ (_) 22 ] 

+ |[Oy)oo + («y)oi + («y)o2 - («y)2o - Oy)2i - Oy)22] 

+ ^[(^<5)00 + (p^)01 + («5 )o 2 — (Qf5)20 — 05)21 _ (0^)22] 

+ ^[(^y)oo - (^y)02 -( 卽 )10 + (^K)ll -( 卢 y)21 + (Py)22] 

+ I [ 0^)00 — ( P ^)02 — ( 抑 )11 + (/ 35 ) i 2 —⑽ )20 + ( 抑 ) 21 ] 

+ 去 [(y <5)oo — (k^)oi — (k<5)io + (yS)n + — (y<% 2 ] (14.8) 

Defining the interaction terms such that, for example, 

2 

〉 ： 《)" = 0 for / = 0 , 1 , 2 
7=0 

then in (14.8) all terms in (ay)ik, (oi8)u drop out. Concerning the remain¬ 

ing interaction terms, we make use of the two equivalent parameterizations, for 
example, 


B xC = BC-\-BC 2 
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Table 14.2 Orthogonal Main Effect Plan 3 4 一 2 

options nodate pageno=l ； 
data OMEP ； 
input A B C D Y @@; 
datalines ； 

0 0 0 0 5 1 1 2 0 7 2 2 1 0 6 

0 1 1 2 8 1 0 1 1 4 0 2 2 1 5 

1 2 0 2 8 2 0 2 2 9 2 1 0 1 7 

run ； 

proc print data=OMEP ； 
titlel 'TABLE 14.2'; 

title2 'ORTHOGONAL MAIN EFFECT PLAN 3** (4-2)'; 
run ； 

proc glm data=OMEP ； 
class ABC D ； 

model y=A BCD A*B A*C A*D B*C B*D C*D/el; 
title3 'ANALYSIS AND ESTIMABLE FUNCTIONS'; 
run ； 

Obs A B C D Y 

1 0 0 0 0 5 

2 112 0 7 

3 2 2 1 0 6 

4 0 112 8 

5 10 114 

6 0 2 2 1 5 

7 1 2 0 2 8 

8 2 0 2 2 9 

9 2 10 17 

The GLM Procedure 

ANALYSIS AND ESTIMABLE FUNCTIONS 

Class Level Information 

Class Levels Values 

A 3 0 12 

B 3 0 12 

C 3 0 12 


D 


3 


0 12 
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Table 14.2 (Continued) 





Number 

of observations 9 





Type I 

Estimable Functions 






-----Coefficients 一 —— 


Effect 



A 

B 

C 

Intercept 

0 

0 

0 

A 

0 


L2 

0 

0 

A 

1 


L3 

0 

0 

A 

2 


-L2-L3 

0 

0 

B 

0 


0 

L5 

0 

B 

1 


0 

L6 

0 

B 

2 


0 

-L5-L6 

0 

C 

0 


0 

0 

L8 

C 

1 


0 

0 

L9 

C 

2 


0 

0 

-L8-L9 

D 

0 


0 

0 

0 

D 

1 


0 

0 

0 

D 

2 


0 

0 

0 

A*B 

0 

0 

0.3333*L2 

0.3333*L5 

0.3333*L8 

A*B 

0 

1 

0.3333*L2 

0.3333*L6 

0.3333*L9 

A*B 

0 

2 

0.3333*L2 

-0.3333*L5-0.3333*L6 

-0.3333*L8-0.3333*L9 

A*B 

1 

0 

0.3333*L3 

0.3333*L5 

0.3333*L9 

A*B 

1 

1 

0.3333*L3 

0.3333*L6 

-0.3333*L8-0.3333*L9 

A*B 

1 

2 

0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L8 

A*B 

2 

0 

-0.3333*L2-0.3333*L3 

0.3333*L5 

-0.3333*L8-0.3333*L9 

A*B 

2 

1 

-0.3333*L2-0.3333*L3 

0.3333*L6 

0.3333*L8 

A*B 

2 

2 

-0.3333*L2-0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L9 

A*C 

0 

0 

0.3333*L2 

0.3333*L5 

0.3333*L8 

A*C 

0 

1 

0.3333*L2 

0.3333*L6 

0.3333*L9 

A*C 

0 

2 

0.3333*L2 

-0.3333*L5-0.3333*L6 

-0.3333*L8-0.3333*L9 

A*C 

1 

0 

0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L8 

A*C 

1 

1 

0.3333*L3 

0.3333*L5 

0.3333*L9 

A*C 

1 

2 

0.3333*L3 

0.3333*L6 

-0.3333*L8-0.3333*L9 

A*C 

2 

0 

-0.3333*L2-0.3333*L3 

0.3333*L6 

0.3333*L8 

A*C 

2 

1 

-0.3333*L2-0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L9 

A*C 

2 

2 

-0.3333*L2-0.3333*L3 

0.3333*L5 

-0.3333*L8-0.3333*L9 

A*D 

0 

0 

0.3333*L2 

0.3333*L5 

0.3333*L8 

A*D 

0 

1 

0.3333*L2 

-0.3333*L5-0.3333*L6 

-0.3333*L8-0.3333*L9 

A*D 

0 

2 

0.3333*L2 

0,3333*L6 

0.3333*L9 

A*D 

1 

0 

0.3333*L3 

0.3333*L6 

-0.3333*L8-0.3333*L9 

A*D 

1 

1 

0.3333*L3 

0.3333*L5 

0.3333*L9 
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Table 14.2 (Continued) 


Type I Estimable Functions 


Effect 

Intercept 

A 0 

A 1 

A 2 

B 0 

B 1 

B 2 

C 0 

C 1 

C 2 

D 0 

D 1 

D 2 

A*B 0 0 

A*B 0 1 

A*B 0 2 

A*B 1 0 

A*B 1 1 

A*B 12 

A*B 2 0 

A*B 2 1 

A*B 2 2 

A*C 0 0 

A*C 0 1 

A*C 0 2 

A*C 1 0 

A*C 1 1 

A*C 12 

A*C 2 0 

A*C 2 1 

A*C 2 2 

A*D 0 0 

A*D 0 1 

A*D 0 2 

A*D 10 

A*D 11 


Coefficients ---- 
A*B A*C A*D 


Lll 

L12 

-L11-L12 


0.3333*L11 000 
-0.3333*Lll-0.3333*L12 000 
0.3333*L12 000 
0.3333*L12 000 
0.3333*L11 000 
-0.3333*Lll-0.3333*L12 000 
-0.3333*Lll-0.3333*L12 000 
0.3333*L12 000 
0.3333*L11 000 


0.3333*L11 0 
-0.3333*Lll-0.3333*L12 0 
0.3333*L12 0 
-0.3333*Lll-0.3333*L12 0 
0.3333*L12 0 
0.3333*L11 0 
0.3333*L12 0 
0.3333*L11 0 
-0.3333*Lll-0.3333*L12 0 


0.3333*L11 0 
0.3333*L12 0 
-0.3333*Lll-0.3333*L12 0 
0.3333*L11 0 
0.3333*L12 0 


B*C 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


B*D 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 


C*D 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 
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Table 14.2 (Continued) 





Type I 

Estimable Functions 







-----Coefficients ---- 



Effect 



A 

B 

C 


A*D 

1 

2 

0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L8 


A*D 

2 

0 

-0.3333*L2-0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L9 


A*D 

2 

1 

-0.3333*L2-0.3333*L3 

0.3333*L6 

0.3333*L8 


A*D 

2 

2 

-0.3333*L2-0.3333*L3 

0.3333*L5 

-0.3333*L8-0.3333 

*L9 

B*C 

0 

0 

0.3333*L2 

0.3333*L5 

0.3333*L8 


B*C 

0 

1 

0.3333*L3 

0.3333*L5 

0.3333*L9 


B*C 

0 

2 

-0.3333*L2-0.3333*L3 

0.3333*L5 

-0.3333*L8-0.3333 

*L9 

B*C 

1 

0 

-0.3333*L2-0.3333*L3 

0.3333*L6 

0.3333*L8 


B*C 

1 

1 

0.3333*L2 

0.3333*L6 

0.3333*L9 


B*C 

1 

2 

0.3333*L3 

0.3333*L6 

-0.3333*L8-0.3333 

*L9 

B*C 

2 

0 

0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L8 


B*C 

2 

1 

-0.3333*L2-0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L9 


B*C 

2 

2 

0.3333*L2 

-0.3333*L5-0.3333*L6 

-0.3333*L8-0.3333 

*L9 

B*D 

0 

0 

0.3333*L2 

0.3333*L5 

0.3333*L8 


B*D 

0 

1 

0.3333*L3 

0.3333*L5 

0.3333*L9 


B*D 

0 

2 

-0.3333*L2-0.3333*L3 

0.3333*L5 

-0.3333*L8-0.3333 

*L9 

B*D 

1 

0 

0.3333*L3 

0.3333*L6 

-0.3333*L8-0.3333 

*L9 

B*D 

1 

1 

-0.3333*L2-0.3333*L3 

0.3333*L6 

0.3333*L8 


B*D 

1 

2 

0.3333*L2 

0.3333*L6 

0.3333*L9 


B*D 

2 

0 

-0.3333*L2-0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L9 


B*D 

2 

1 

0.3333*L2 

-0.3333*L5-0.3333*L6 

-0.3333*L8-0.3333*L9 

B*D 

2 

2 

0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L8 


C*D 

0 

0 

0.3333*L2 

0.3333*L5 

0.3333*L8 


C*D 

0 

1 

-0.3333*L2-0.3333*L3 

0.3333*L6 

0.3333*L8 


C*D 

0 

2 

0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L8 


C*D 

1 

0 

-0.3333*L2-0.3333*L3 

-0.3333*L5-0.3333*L6 

0.3333*L9 


C*D 

1 

1 

0.3333*L3 

0.3333*L5 

0.3333*L9 


C*D 

1 

2 

0.3333*L2 

0.3333*L6 

0.3333*L9 


C*D 

2 

0 

0.3333*L3 

0.3333*L6 

-0.3333*L8-0.3333 

*L9 

C*D 

2 

1 

0.3333*L2 

-0.3333*L5-0.3333*L6 

-0.3333*L8-0.3333 

*L9 

C*D 

2 

2 

-0.3333*L2-0.3333*L3 

0.3333*L5 

-0.3333*L8-0.3333 

*L9 




Type I 

Estimable Functions 







-----Coefficients ---- 


- 

Effect 



D 

A*B A*C A*D 

B*C B*D 

C*D 

A*D 

1 

2 

-0.3333*Lll-0.3333*L12 000 

0 0 

0 

A*D 

2 

0 

0.3333*L11 

0 0 0 

0 0 

0 

A*D 

2 

1 

0.3333*L12 

0 0 0 

0 0 

0 

A*D 

2 

2 

-0.3333*Lll-0,3333*L12 000 

0 0 

0 
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Table 14.2 (Continued) 


B*C 

0 

0 

0.3333*L11 

0 

0 

0 

0 

0 


0 

B*C 

0 

1 

0.3333*L12 

0 

0 

0 

0 

0 


0 

B*C 

0 

2 

-0.3333*Lll-0.3333*L12 

0 

0 

0 

0 

0 


0 

B*C 

1 

0 

0.3333*L12 

0 

0 

0 

0 

0 


0 

B*C 

1 

1 

-0.3333*Lll-0.3333*L12 

0 

0 

0 

0 

0 


0 

B*C 

1 

2 

0.3333*L11 

0 

0 

0 

0 

0 


0 

B*C 

2 

0 

-0.3333*Lll-0.3333*L12 

0 

0 

0 

0 

0 


0 

B*C 

2 

1 

0.3333*L11 

0 

0 

0 

0 

0 


0 

B*C 

2 

2 

0.3333*L12 

0 

0 

0 

0 

0 


0 

B*D 

0 

0 

0.3333*L11 

0 

0 

0 

0 

0 


0 

B*D 

0 

1 

0.3333*L12 

0 

0 

0 

0 

0 


0 

B*D 

0 

2 

-0.3333*Lll-0.3333*L12 

0 

0 

0 

0 

0 


0 

B*D 

1 

0 

0.3333*L11 

0 

0 

0 

0 

0 


0 

B*D 

1 

1 

0.3333*L12 

0 

0 

0 

0 

0 


0 

B*D 

1 

2 

-0.3333*Lll-0.3333*L12 

0 

0 

0 

0 

0 


0 

B*D 

2 

0 

0.3333*L11 

0 

0 

0 

0 

0 


0 

B*D 

2 

1 

0.3333*L12 

0 

0 

0 

0 

0 


0 

B*D 

2 

2 

-0.3333*Lll-0.3333*L12 

0 

0 

0 

0 

0 


0 

C*D 

0 

0 

0.3333*L11 

0 

0 

0 

0 

0 


0 

C*D 

0 

1 

0.3333*L12 

0 

0 

0 

0 

0 


0 

C*D 

0 

2 

-0.3333*Lll-0.3333*L12 

0 

0 

0 

0 

0 


0 

C*D 

1 

0 

0.3333*L11 

0 

0 

0 

0 

0 


0 

C*D 

1 

1 

0.3333*L12 

0 

0 

0 

0 

0 


0 

C*D 

1 

2 

-0.3333*Lll-0.3333*L12 

0 

0 

0 

0 

0 


0 

C*D 

2 

0 

0.3333*L11 

0 

0 

0 

0 

0 


0 

C*D 

2 

1 

0.3333*L12 

0 

0 

0 

0 

0 


0 

C*D 

2 

2 

-0.3333*Lll-0.3333*L12 

0 

0 

0 

0 

0 


0 

Dependent 

Variable : Y 











Sum of 








Source 



DF Squares 


Mean 

Square 

F Value 


Pr 

> F 

Model 



8 22.22222222 


2.77777778 





Error 



0 0.00000000 








Corrected 

Total 8 22.22222222 











R-Square Coeff Var 


Root MSE 

Y Mean 







1.000000 . 



. 6. 

.555556 




Source 



DF Type I SS 


Mean 

Square 

F Value 


Pr 

> F 

A 



2 2.88888889 


1.44444444 





B 



2 2.88888889 


1.44444444 
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Table 14.2 (Continued) 


c 

2 

1.55555556 

0.77777778 


D 

2 

14.88888889 

7.44444444 


A*B 

0 

0.00000000 



A*C 

0 

0.00000000 



A*D 

0 

0.00000000 



B*C 

0 

0.00000000 



B*D 

0 

0.00000000 



C*D 

0 

0.00000000 




(see Section 10.2) where the 2 d.f. of BC consist of comparisons among BCq, 
BCi, and BC 2 , with 


BC 0 ： ( 仰 )00 + (Pr)n + ( 卽 )21 
BCi ： (^y)io + (Py)oi + (Py)22 
BC2 ： (Py)io + (Pv)o2 + (^y)n 

and similarly 

BCq ： 03 y)oo + ( 外 )11 + ( 卽 )22 
BC\\ 03 y)io +(~)02 +( 卢 K)2i 
03y)2o + (Py)oi + (0y)i2 


We can write out similar expressions involving the (P8)jk and (y8)kt terms. 
Utilizing these equivalencies in (14.8) leads to the same result as given in (14.7). 

^ i □ 


14.2.4 Plackett - B urman Designs 

For 2 n factorials a special method of constructing a large number of OMEPs was 
provided by Plackett and B urman (1946). The method is based on the existence 
of Hadamard matrices. 


Definition 14.1 A square matrix H of order N with entries —1, 1 is called 
a Hadamard matrix of order N if 

HH f = HH = NI n (14.9) 

□ 

Condition (14.9) is equivalent to saying that for a Hadamard matrix H the 
rows are pairwise orthogonal and the columns are pairwise orthogonal. Given a 
Hadamard matrix H, an equivalent matrix with all elements in the first column 
equal to +1 can be obtained by multiplying by —1 each element in every row 









576 


MAIN EFFECT PLANS 


of H whose first element is —1. Such a matrix will be called seminormalized. 
Through a similar operation one can transform a Hadamard matrix H such that 
all elements in the first row are equal to +1. If the first column and first row 
contain only +1 elements, then the matrix is said to be normalized. Except for 
the trivial Hadamard matrices 

H l = (l),H 2 = 

it is known that a necessary condition of the existence of a Hadamard matrix of 
order N is that N = 0 mod 4 (Paley, 1933). Hadamard matrices for N < 200 
have been constructed (see, e.g., Hedayat and Wallis, 1978; Hedayat, Sloane and 
Stufken, 1999). For N = 2 l , for example, H can be constructed as the i-fo\d 
Kronecker product of H 2 with itself. Hadamard matrices for N = At and t = l, 
2, ..8 are given by Hedayat and Wallis (1978) and for ，= 1 ， 2, …， 12 by 
Dey (1985). " 

Consider now a Hadamard matrix H of order N in normalized (or seminor¬ 
malized) form and delete from it the first column. Denote the resulting matrix 
by H and identify the columns with factors F\, F 2 , …, F^-\. We then take 

^* = 1(5 + 0J f ) 

and consider each row of H* as a treatment combination from the 2 N ~ l facto¬ 
rial. These N runs constitute an OMEP for — 1 factors, with N = 4t(t = 1, 
2, .. .)• The fact that this FFD is an OMEP follows immediately from the prop¬ 
erties of a Hadamard matrix in seminormalized (and hence normalized) form, 
namely for any column (factor) we have the same number of +1 and —1 ele¬ 
ments in H (and hence the same number of +1 and 0 elements in H^, and for 
any pair of columns we have the same number of (+ 1 , + 1 ) ，（ + 1 , — 1 )，（一 1 ， 
+ 1) ，（一 1 ，一 1) combinations in H). FFDs constructed in this manner are called 
Plackett-Burman designs. Actually, Plackett and Burman (1946) list designs for 
t = l, 2, … ， 25 (except23 which was given by Baumert, Golomb, and Hall, 
1962) using //* = + 33’). In this way the OMEP includes the control, 

that is, the treatment combination (0, 0, ..0). We note here that for N = the 
Plackett-Burman designs are the same as the Fisher plans forn = N— 1 factors 
discussed earlier. Also, the Plackett-Burman designs are relevant for the prob¬ 
lem of weighing N — \ light objects in N weighings, a problem first discussed 
by Yates (1935) for a spring balance and by Hotelling (1944) for a chemical bal¬ 
ance. Hotelling (1944) showed that the optimum weighing design was obtained 
by using Hadamard matrices. Kempthome (1948) discussed the problem from 
the viewpoint of factorial experimentation and showed how fractional factorials 
could be used. For a detailed discussion of weighing designs the reader is referred 
to Banerjee (1975). 

In contrast to the Fisher plans the alias structure for the Plackett-Burman 
designs is rather complicated and difficult to determine (Margolin, 1968; Lin and 
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Draper, 1993). In any case, each main effect is aliased with a long string of 
2-factor interactions and no expressions like (14.7) can be written out easily and 
studied in case the estimate of a main effect turns out to be “large.” 

Of course, folding-over the Plackett-Burman design leads to a resolution IV 
design, thereby dealiasing the main effects. In many cases less drastic measures 
may, however, achieve the goal of dealiasing some main effects, and that may be 
all that is needed since main effect plans are often used for screening purposes and 
only some of the main effects may be important; that is, only some of the factors 
may be active. This leads to investigating the projections of a Plackett-Burman 
design into smaller, say k, dimensions, where k < n. Here projection simply 
means to ignore the remaining n — k factors after k factors have been identified, 
at least in a rough and preliminary sense, as being active. For example, Lin and 
Draper (1992) have shown that for the 2 11 Plackett-Burman design in 12 runs 
the projection io k = A dimensions shows that one needs to add only one more 
run to identify a 2jy 1 design among the 13 runs. Or for A: = 5 one needs to add 
8 more runs to obtain a 2jy 1 design plus additional runs. Similar results have 
been obtained for other Plackett-Burman designs (Lin and Draper, 1993). 

We shall use the 2 11 Plackett-Burman design in 12 runs, as given in 
Table 14.3，to make some additional comments about projections and the esti¬ 
mation of main effects and 2 -factor interactions: 

1. k = 2: Since the Plackett-Burman designs are orthogonal arrays of strength 
2 (see Section 14.2.5)，it follows that each combination of the levels of two 
factors occurs equally often. Hence, for the two specified factors we can 
estimate the main effects and the 2 -factor interaction. 

2. k = 3: Inspection of the design in Table 14.3 shows that it contains a com¬ 
plete 2 3 factorial, allowing the estimation of the three main effects and three 
2-factor interactions. The 12 runs no longer constitute an orthogonal design 
and hence the estimators will be correlated. 

3. k = 4: It turns out that the available 11 d.f. allow the estimation of the four 
main effects and their six 2-factor interactions. This is shown in Table 14.3 
using the E-option in SAS PROC GLM (keeping in mind that, e.g., EFo\ + 
EJq\ + ELoi = 0). The ANOVA in Table 14.3 also shows that the design is 
no longer orthogonal as indicated by different numerical values for the type 
I and type III sums of squares. 


14.2.5 Other Methods 

We mention here briefly two other methods of constructing OMEPs. Addelman 
and Kempthorne (1961) describe a procedure for constructing an OMEP for an 
s n factorial in 2s l runs, where n = [2(〆 一 \)/{s — 1) — 1]. This procedure is an 
extension of the method used for the Fisher plans, using, however, a different 
correspondence between the factors and factor combinations from the 2 l factorial 
and the n factors for the FFD. This method leads to OMEPs such as TV = 18 = 
2 . 3 2 runs for the 3 7 , = 54 = 2 • 3 3 runs for the 3 25 , or TV = 32 = 2 .4 2 runs 

for the 4 9 factorial. 
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Table 14.3 Plackett - Burman Design for 11 Factors in 12 Runs 


options pageno=l nodate； 
data PBdesign； 

input ABCDEFGHJKLY; 
datalines； 

11011100010 10 
01101110001 15 
10110111000 20 
01011011100 25 
00101101110 23 
00010110111 32 
10001011011 29 
11000101101 33 
11100010110 30 
01110001011 31 
10111000101 28 
00000000000 30 
/ 

run ； 


proc print data=PBdesign; 

titlel 'TABLE 14.3'; 

title2 'PLACKETT-BURMAN DESIGN'; 

title3 'FOR 11 FACTORS IN 12 RUNS'; 

run； 

proc glm data=PBdesign; 

Class ABCDEFGHJKL; 
model Y=A BCDEFGHJKL; 
title4 'ANALYSIS OF VARIANCE 1 ; 
run； 


proc glm data=PBdesign; 
class E F J L； 
model Y=E|F|j|L@2/e ； 

title3 'PROJECTION ON FACTORS E,F,J,L' ; 

title4 'ANALYSIS WITH MAIN EFFECTS AND TWO-FACTOR INTERACTIONS 1 ; 


run； 

Obs ABCDEFGHJKL Y 


1110 1 

2 0 110 

3 10 11 

4 0 10 1 

5 0 0 1 0 

6 0 0 0 1 

7 10 0 0 

8 110 0 

9 1110 

10 0 1 1 1 

11 1 0 1 1 

12 0 0 0 0 


1 

1 

0 

1 

1 

0 

1 

0 

0 

0 

1 

0 


1 0 0 0 1 
110 0 0 
1110 0 
0 1110 
10 111 
110 11 
0 110 1 
10 110 
0 10 11 
0 0 10 1 
0 0 0 1 0 
0 0 0 0 0 


0 10 
1 15 

0 20 
0 25 

0 23 

1 32 

1 29 

1 33 

0 30 

1 31 

1 28 
0 30 
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Table 14.3 {Continued) 


Class Level Information 


Class Levels Values 


A 

B 

C 

D 

E 

F 

G 

H 

J 

K 

L 


2 0 1 
2 0 1 
2 0 1 
2 0 1 
2 0 1 
2 0 1 
2 0 1 
2 0 1 
2 0 1 
2 0 1 
2 0 1 


Number of observations 12 


Dependent Variable : Y 





Sum 

of 


Source 


DF 

Squares Mean Square 

F Value 

Model 


11 

575.0000000 52.2727273 


Error 


0 

0 . 0000000 . 


Corrected Total 

11 

575.0000000 



R-Square 

Coeff Var 

Root MSE 

Y Mean 


1.000000 



25.50000 

Source 

DF 

Type 工 SS 

Mean Square F 

Value 

A 

1 

3 

0000000 

3.0000000 


B 

1 

27 

0000000 

27.0000000 


C 

1 

12 

0000000 

12.0000000 


D 

1 

16 

3333333 

16.3333333 


E 

1 

176 

3333333 

176.3333333 


F 

1 

133 

3333333 

133.3333333 


G 

1 

1 

3333333 

1.3333333 


H 

1 

21 

3333333 

21.3333333 


J 

1 

108 

0000000 

108.0000000 


K 

1 

1 

3333333 

1.3333333 


L 

1 

75 

0000000 

75.0000000 



Pr > F 


Pr > F 
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MAIN EFFECT PLANS 

Source 

DF 

Type 工 II SS 

Mean Square 

F Value 

Pr > F 

A 

1 

3.0000000 

3.0000000 



B 

1 

27.0000000 

27.0000000 



C 

1 

12.0000000 

12.0000000 



D 

1 

16.3333333 

16.3333333 



E 

1 

176.3333333 

176.3333333 



F 

1 

133.3333333 

133.3333333 



G 

1 

1.3333333 

1.3333333 



H 

1 

21.3333333 

21.3333333 



J 

1 

108.0000000 

108.0000000 



K 

1 

1.3333333 

1.3333333 



L 

1 

75.0000000 

75.0000000 





PROJECTION ON 

FACTORS E, F, 

J, L 



ANALYSIS WITH MAIN EFFECTS AND TWO-FACTOR INTERACTIONS 

Class Level Information 

Class Levels Values 

E 2 0 1 

F 2 0 1 

J 2 0 1 

L 2 0 1 

Number of observations 12 


General Form of Estimable Functions 


Effect 



Coefficients 

Intercept 



LI 

E 

0 


L2 

E 

1 


L1-L2 

F 

0 


L4 

F 

1 


L1-L4 

E*F 

0 

0 

L6 

E*F 

0 

1 

L2-L6 

E*F 

1 

0 

L4-L6 

E*F 

1 

1 

L1-L2-L4+L6 

J 

0 


L10 

J 

1 


L1-L10 
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Table 14.3 (Continued) 


E*J 

0 

0 


L12 



E*J 

0 

1 


L2-L12 



E*J 

1 

0 


L10-L12 



E*J 

1 

1 


L1-L2-L10+L12 



F*J 

0 

0 


L16 



F*J 

0 

1 


L4-L16 



F*J 

1 

0 


L10-L16 



F*J 

1 

1 


L1-L4-L10+L16 



L 

0 



L20 



L 

1 



L1-L20 



E*L 

0 

0 


L22 



E*L 

0 

1 


L2-L22 



E*L 

1 

0 


L20-L22 



E*L 

1 

1 


L1-L2-L20+L22 



F*L 

0 

0 


L26 



F*L 

0 

1 


L4-L26 



F*L 

1 

0 


L20-L26 



F*L 

1 

1 


L1-L4-L20+L26 



J*L 

0 

0 


L30 



J*L 

0 

1 


L10-L30 



J*L 

1 

0 


L20-L30 



J*L 

1 

1 


L1-L10-L20+L30 


Dependent Variable : 

Y 







Sum 

of 




Source DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 10 

574.5000000 

57.4500000 

114.90 

0.0725 

Error 1 

0.5000000 

0.5000000 



Corrected Total 11 

575.0000000 




R-Square 

Coeff Var 


Root MSE 

Y Mean 


0.999130 

2.772968 


0.707107 

25.50000 


Source DF Type 工 SS 


Mean Square F 

Value 

Pr > F 

E 1 176. 

3333333 


176.3333333 

352.67 

0.0339 

F 1 133 . 

3333333 


133.3333333 

266.67 

0.0389 
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Table 14.3 (Continued) 


E*F 

1 

65.3333333 

65.3333333 

130.67 

0.0556 

J 

1 

66.6666667 

66.6666667 

133.33 

0.0550 

E*J 

1 

2.6666667 

2.6666667 

5.33 

0.2601 

F*J 

1 

112.6666667 

112.6666667 

225.33 

0.0423 

L 

1 

10.8000000 

10.8000000 

21.60 

0.1349 

E*L 

1 

5.4857143 

5.4857143 

10.97 

0.1867 

F*L 

1 

0.1758242 

0.1758242 

0.35 

0.6592 

J*L 

1 

1.0384615 

1.0384615 

2.08 

0.3862 


Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

E 

1 

61.03846154 

61.03846154 

122.08 

0.0575 

F 

1 

61.03846154 

61.03846154 

122.08 

0.0575 

E*F 

1 

9.34615385 

9.34615385 

18.69 

0.1447 

J 

1 

61.03846154 

61.03846154 

122.08 

0.0575 

E*J 

1 

0.11538462 

0.11538462 

0.23 

0.7149 

F*J 

1 

72.11538462 

72.11538462 

144.23 

0.0529 

L 

1 

9.34615385 

9.34615385 

18.69 

0.1447 

E*L 

1 

5.65384615 

5.65384615 

11.31 

0.1840 

F*L 

1 

0.11538462 

0.11538462 

0.23 

0.7149 

J*L 

1 

1.03846154 

1.03846154 

2.08 

0.3862 


Another method is derived from the fact that each OMEP is an orthogonal 
array of strength 2 (Rao, 1946a). More precisely, if n denotes the number of 
factors, N the number of runs, 5 the number of levels, then an OMEP can be 
denoted by OA[A^, n, s, 2]. Bose and Bush (1952) show that an OA[A5 2 , Xs, s, 
2] can be constructed if 入 and s are both powers of the same prime p. Suppose 
入 = p u , s = p v . Then some examples of OMEPs obtained with this method are 
given below: 


Factorial Number of Runs (N) 


2 4 

2 8 

3 9 

3 27 


8 

16 

27 

81 


Obviously, these will not be saturated OMEPs. 


14.3 ORTHOGONAL RESOLUTION III DESIGNS FOR 
ASYMMETRICAL FACTORIALS 


In many practical situations it is the case that we shall use factors with different 
numbers of levels. This is dictated by the nature of the experiment and the 
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type of factors included in the experiment as well as the suspected nature of 
their response at different levels. And again, if we want to investigate a large 
number of factors, concentrating on estimating main effects may be a prudent 
thing to do. 

14.3.1 Kronecker Product Design 

An easy method to construct an OMEP for an s 1 ^ 1 x x ... x Sq q asymmetrical 
factorial is to consider each component symmetrical factorial separately, construct 
an OMEP for it, and then take the Kronecker product of these designs. For 
example, if we have a 2 7 x 3 4 we can construct a 2j^ 4 and a 3j^ 2 and the 
Kronecker product of these two designs will yield a FFD, which is an OMEP, 
with N = S - 9 = 72 runs. Obviously, this design contains far too many treatment 
combinations to account for the 7 + 4-2=15 d.f. for the main effects. It is for 
this reason that other methods of constructing OMEPs for asymmetrical factorials 
have been developed. 

14.3.2 Orthogonality Condition 

A method that yields a very large number of OMEPs for many different situa¬ 
tions was developed by Addelman and Kempthome (1961) (see also Addelman, 
1962a). These designs are based on OMEPs for symmetrical factorials with sub¬ 
sequent collapsing and replacement of levels for certain factors. This procedure 
will be described more precisely in Section 14.3.3. For now we mention only 
that as a consequence of this procedure the levels of one factor no longer occur 
together with the same frequency with each of the levels of any other factor, as 
was the case for the OMEPs for symmetrical factorials. They do, however, occur 
together with proportional frequencies and that, as we shall show, is sufficient to 
guarantee orthogonal estimates of main effects. 

To reduce the amount of writing let us consider the case of two factors, A 
and B say, where A has a levels and B has b levels. Suppose that in a given 
FFD the /th level of A occurs together with the jth level of B for Nij times. 
Let N ij = N i.y Hi N ij = N -j^ Hi N i. = E ； N J = the total number of 
runs in the FFD. If the level combinations occur with proportional frequencies, 
we have Nij = Ni,Nj/N for every i and j (i = 0, 1, ..a — l; 7=01,..., 
b — 1). We then state the following theorem. 

Theorem 14.1 A sufficient condition for a main effect plan for two factors 
A and B with a and b levels, respectively, to be an OMEP is that each level 
combination for the two factors occurs with proportional frequency. 

Proof Consider model (13.28) and rewrite it as 

//x\ 

E(y) = (0, Xn, X 12 ) A 

\b) 
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where A! = (Ao, A\,..., A a -\) and B r = (Bq, Bi, , Bb-\) represent the main 
effect components for the two factors A and B, respectively. We then rewrite the 
N E (13.29) as 



- J'3 

0 r X u 

o r x l2 - 




- 3'y - 



JSV 

X' n X n 

X' n x 12 


A 

= 

^113- 

(14.10) 



X[ 2 X n 

X\ 2 X l2 _ 




- 心 - 


Denote the coefficient matrix in (14.10) by K. 

Then we have 



_ N 

Nq. 

Ni. •• 

• N a —\” 

N.o 

Na 

... N ”b — 1 


No. 

No. 

0 •• 

0 


Noi 

… N 0,b-\ 



0 

Ni. •• 

0 

Nio 

Nn 

… N l,b-1 

K = 

N a -l,. 

0 

0 •• 

• N a —\ ” 

N a - 

-1,0 

N a -[，i 

... N a —\Jy—l 


N.o 


• • 

. N a-\,0 

N.o 

0 

... 0 


N.i 

Noi 

N U .. 

. N a-l,l 

0 


... 0 


_ N ”b-\ N 0,b-l 

N hb-1 .. 

• N a —]_,b— 

1 

0 

0 

… N .,b-l _ 


To obtain a solution to (14.10) we need to find a generalized inverse for K. If 
Nij = Ni.N.j/N for each i and j, then such a g inverse is given by 


K 


N 

1 

_ 7V 

1 

_ 7V 

1 

l 

_ 7V 

1 

_ 7V 

1 

_ 7V 


0 0 


1 

A^a 


l 




Nc 


Ah 


N 


b-l 


(14.11) 
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This follows easily by verifying that KK—K = K using the proportionality con¬ 
dition. The reader may recognize that the solution and hence K— can be obtained 
by imposing the conditions = 0 = Nj Bj . Consider now the con- 

trasts ^CiAi and ^djBj (with ^Ci = 0, ^ dj = 0) belonging to the main 
effects A and B, respectively. It follows then immediately that 

cov (J] Ci Ai, dj Bj ) = (0, c', 0)r (o) = 0 (14.12) 

where c' = (co, c\,, c a -\), d r = (do, di,, db-\), that is, estimates of con¬ 
trasts for the two main effects are uncorrelated. Hence we have an OMEP. □ 

Obviously the proof can be extended in a straightforward manner to any 
number of factors with the proportionality condition holding for any pair of 
factors. We mention also that Addelman and Kempthorne (1961) proved that 
proportional frequency is actually a necessary and sufficient condition for a main 
effect plan to be an OMEP (see also Dey, 1985). 

The methods of constructing main effect plans as described in the follow¬ 
ing section make use of the proportionality condition of Theorem 14.1. Before 
proceeding, however, we need to clarify an argument put forth by Srivastava 
and Ghosh (1996). They argue that main effect plans with proportional (and not 
equal) frequencies are not OMEPs. Clearly, Eq. (14.12) shows that the estimators 
of contrasts of main effects belonging to two different factors are uncorrelated. 
The concern of Srivastava and Ghosh (1996) has to do with estimators of contrasts 
belonging to the same factor, for example, 

(14.13) 

i i 

with Ci = Q = di and C[di = 0. Using K~ from (14.11), it follows 
immediately that the contrasts in (14.13) are correlated (see also Section 1.7.3). 
In order to produce uncorrelated contrasts Addelman (1962a) considers contrasts 
of the form 

J2 Ni - d i Ai (14-14) 

i i 

with E/ N L cf =0 = Zi Uf and E/ ^ c* = 0. 

We emphasize that contrasts of the form (14.13) or (14.14) are considered only 
for purposes of posthoc analyses (see Chapter 1.7). For qualitative factors such 
contrasts are prespecified, for quantitative factors they represent various forms 
of trends, such as linear, quadratic, and so forth. In either case the contrasts 
must be chosen such that they are meaningful in the context of the experiment, 
whether they are orthogonal or not. And that does not affect the properties of the 
design. Whether a main effect plan is an OMEP is determined only by whether 
the condition of Theorem 14.1 is satisfied. 
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14.3.3 Addelman - Kempthorne Methods 

We now describe three methods for constructing OMEPs for asymmetrical fac¬ 
torials as developed by Addelman and Kempthorne (1961). 

Method 1 

We consider the 1 x s; 2 x ... x factorial and assume that s\ > S 2 
> ••• > V 
Collapsing Factors 

We can construct an OMEP with N = runs, where 夕 i is a prime or prime 
power and 

q ^ _ i 

^~ - (14.15) 

台 A - 1 

in the following way. We first construct an OMEP for the s 1 ^ factorial in sf trials 
where i is chosen so that (14.15) is satisfied and where n = {s [ — l)/(^i — 1) 
using the method described in Sections 14.2 and 11.7. We write the plan as an 
N x n array, D say, such that the columns denote the factors and each row 
represents a treatment combination (run) for the n factors with s\ levels each. 
We shall refer to D also as the auxiliary OMEP. In the context of the x 
s^ 2 x ... x Sq' factorial the first n\ columns in D correspond to the first set of 
factors, each having levels. For the following factors we use what Addelman 
and Kempthorne (1961) refer to as collapsing factors with s\ levels to factors 
occurring at Si levels (/ = 2, 3, .. q) by setting up a many-one correspondence 
between the set of si levels and the set of Si levels. This means that different levels 
of the si-level factor are mapped into the same level of the -level factor. As 
an example, suppose we have = 5, S 2 = 3. We then might have the following 
mapping: 

Five-Level Factor Three-Level Factor 

0 — 0 

1 — ^ 1 

2 — ^ 2 

3 — 0 

4 — ^ 1 

Proceeding in this way we replace the next n ，2 factors by 52 -level factors, the 
following n 3 factors by ^ 3 -level factors, and so on, until all factors have been 
accommodated. Since Y^=\ n i ^ ^ it may be necessary to delete some factors 
from the basic OMEP if the inequality holds. Since the collapsing procedure 
leads automatically to proportional frequencies (for any collapsing scheme) the 
final design is an OMEP. 

We illustrate this procedure with the following example. 
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Example 14.2 Consider the 5 3 x 3 x 2 2 factorial. Since for i = 2, (5 2 — 
1)/(5 — 1) = 6 we can construct an OMEP with 25 runs from an OMEP for the 
5 6 factorial in 25 runs, using the method of Section 11.7.1. The construction of 
this plan is illustrated in Table 14.4a. Using the collapsing schemes 



we then obtain the final design given in Table 14.4b. It can easily be verified 
that the proportionality conditions are met. For example, for factors F 4 and 
F 5 we have Nqq = 6 ， Nq. = 10, A^.o = 15, and with N = 25 we have Nqq = 
10.15/25 = 6 . □ 

Expanding Factors 

Rather than collapsing the -level factor to an 57 -level factor we may, in certain 
situations, expand the ^ 1 -level factor into a set of ^-level factors if s\ = for 
some i. We know that there exists an OMEP for v = (s ^ 1 — l)/(si — 1) factors 
at Si levels with s ^ 1 runs, that is, an We then construct the for 

a suitable n as discussed before and i determined from n ^ (^1 — l)/(^i — !)• 
Next, we replace each of the levels for one factor by a treatment combination 
from the We thus replace an 51 -level factor by v = (s ^ 1 — 1 )/(々 一 1) 

Si -level factors. This method will be most useful for s\ = 4 = 2 2 and perhaps 
for 5 - 1=8 = 2 3 , s\ = 9 = 3 2 . We now illustrate this method in the follow¬ 
ing example. 

Example 14.3 Consider the 4 3 x 3 x 2 3 factorial. Instead of using the col¬ 
lapsing factors method with n\ = 3, ri 2 = 1, = 3 which would lead to an 

OMEP with 64 runs, we use the fact that a 4-level factor can be replaced by 
v = 3 two-level factors. We thus consider an OMEP for the 4 5 factorial that 
can be obtained with 16 runs and then use replacing and collapsing as given in 
Table 14.5. For details of the construction of the auxiliary OMEP using GF(2 2 ) 
arithmetic, we refer to Sections 11.7 and 11.13.1 (Example 11.5). In the final plan 
(Table 14.5b), the elements in GF(2 2 ) are replaced by 0, 1, 2, and 3, respectively. 
We use the following collapsing scheme for F 4： 


Four-Level Factor (F 4 ) 


Three-Level Factor (F 4 ) 
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Table 14.4 Orthogonal Main Effect Plan for 5 3 x 3 x 2 2 Factorial 




Fi 

f 2 

A 

Fa 

F5 

F 6 


\ 

(10) 

(0 1) 

(11) 

(12) 

(13) 

(14) 




a 

Auxiliary Plan (5 6 

in 25 Runs) 



0 

0 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

1 

2 

3 

4 

0 

2 

0 

2 

2 

4 

1 

3 

0 

3 

0 

3 

3 

1 

4 

2 

0 

4 

0 

4 

4 

3 

2 

1 

1 

0 

1 

0 

1 

1 

1 

1 

1 

1 

1 

1 

2 

3 

4 

0 

1 

2 

1 

2 

3 

0 

2 

4 

1 

3 

1 

3 

4 

2 

0 

3 

1 

4 

1 

4 

0 

4 

3 

2 

2 

0 

2 

0 

2 

2 

2 

2 

2 

1 

2 

1 

3 

4 

0 

1 

2 

2 

2 

2 

4 

1 

3 

0 

2 

3 

2 

3 

0 

3 

1 

4 

2 

4 

2 

4 

1 

0 

4 

3 

3 

0 

3 

0 

3 

3 

3 

3 

3 

1 

3 

1 

4 

0 

1 

2 

3 

2 

3 

2 

0 

2 

4 

1 

3 

3 

3 

3 

1 

4 

2 

0 

3 

4 

3 

4 

2 

1 

0 

4 

4 

0 

4 

0 

4 

4 

4 

4 

4 

1 

4 

1 

0 

1 

2 

3 

4 

2 

4 

2 

1 

3 

0 

2 

4 

3 

4 

3 

2 

0 

3 

1 

4 

4 

4 

4 

3 

2 

1 

0 





b. Final Plan 





Fi 

Fl 

F3 

Fa 

f 5 

F 6 



0 

0 

0 

mm 

0 

0 



0 

1 

1 

2 

1 

0 



0 

2 

2 

1 

1 

1 



0 

3 

3 

1 

0 

0 



0 

4 

4 


0 

1 



1 

HI 

1 

1 

1 

1 



1 

1 

2 


0 

0 



1 

2 

3 


0 

0 



1 

3 

4 

2 

0 

1 



1 

4 


l 

1 




2 

■ 

2 

2 

mm 

■ 
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Table 14.4 (Continued) 
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•偏 
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Table 14.5 Orthogonal Main Effect Plan for 4 3 x 3 x 2 3 Factorial 


\ 

X 

a 

\ 

Fi 

Ol ， MO) 

Fi 

Oo, 《 i) 

F3 

Oi ， Ml) 

F 4 

Oi ， W2) 

f 5 

Ol ， W3) 




a. Auxiliary OMEP 



MO, 

UQ 

uo 

uo 

uo 

uo 

uo 

Mo, 

U\ 

uo 

U\ 

U\ 

U2 

M3 

MO, 

U2 

uo 

U2 

U2 

W3 

U\ 

MO, 

W3 

uo 

W3 

M3 

U\ 

U2 

Ml, 

wo 

U\ 

uo 

U\ 

U\ 

U\ 

Ml, 

U\ 

U\ 

U\ 

uo 

M3 

U2 

Ml ， 

U2 

U\ 

U2 

W3 

U2 

uo 

Ml, 

W3 

U\ 

M3 

U2 

uo 

M3 

U2, 

uo 

U2 

uo 

U2 

U2 

U2 

U2, 

U\ 

U2 

U\ 

W3 

uo 

U\ 

U2, 

U2 

U2 

U2 

uo 

U\ 

M3 

以 2, 

M3 

U2 

M3 

U\ 

W3 

uo 

W3, 

uo 

M3 

uo 

M3 

M3 

M3 

W3, 

U\ 

W3 

U\ 

U2 

U\ 

uo 

W3, 

U2 

M3 

U2 

U\ 

uo 

U2 

W3, 

W3 

W3 

W3 

uo 

U2 

U\ 
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Table 14.5 (Continued) 


Fi 

f 2 

f 3 

b. Final Plan 

Fa 

F 5 ，l 

^ 5,2 

F 5,3 

mmgM 

mm 

0 

0 

0 


■ 

cl 

i 
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2 

1 

1 

HI 


2 

2 

0 

0 

1 

1 


3 

3 

1 

1 


1 

l 

mm 

1 

1 

0 

1 

1 

l 

l 

mm 
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E3 

1 

l 

2 

3 

2 

mm 

■■ 

o 

l 

3 

2 


l 

1 

■1 

2 

mm 

2 

2 

l 


l 

2 

l 

3 


mm 

1 

l 

2 

2 

ma 

l 

l 

1 


2 

3 

l 


mm 


El 

3 

mm 

3 


l 

1 


3 

l 

2 

1 

mm 


HI 

3 

2 

1 


l 

■■ 

1 

3 

3 

■ 

2 

■ 

l 

1 


and the following replacement (expansion): 

F 5 Fs，i Fs ，2 F5,3 

0—000 

1 — ^01 1 

2 — ^ 1 0 1 

3—110 □ 

Method 2 

For the 5 1 x s 1 ^ 1 x s; 2 x ... x factorial we can construct an OMEP with 2s[ 
runs, where is a prime or prime power and 

^1 > ^2 > * * * > ^1 < s < 2s\ 

and 

q 〆 _ 1 

1 +< - ~r (14.16) 

A — 1 


We construct first the OMEP for the factorial where v = (< — l)/(^i — 1) 
and i is chosen such that (14.16) is satisfied. We then duplicate the plan except 
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that the levels 0 ， 1 , 2 ,... ， 以 一1 for the first factor, F\, are replaced by 
s\ + 1, s\ + 2, …， 2s\ — l, respectively. This will be an OMEP with 2s[ runs 
for the 2s\ x factorial. If .y < 2s\, we obtain the final plan by collapsing F\ 
and then proceed as in method la. We illustrate this with the following example. 

Example 14.4 We consider the 5 x 3 2 x 2 factorial. We have 1 + 2 + 1 = 
4 < (3^ — 1)/(3 — 1). Hence 1 = 2, and we construct the 3^J 2 , duplicate it (see 
Table 14.6a) and collapse factors F\ and F\ as follows: 


Six-Level Factor (Fi) 


Five-Level Factor (Fi) 


0 

1 

2 

3 

4 

5 



0 

1 

2 

3 

4 
0 


Three-Level Factor (F 4 ) 


Two-Level Factor (F 4 ) 


0 

1 

2 


0 

1 

0 


The final design is given in Table 14.6b. □ 

Method 3 

In some cases the methods described above lead to an unnecessarily large number 
of runs since the auxiliary plan is based on an OMEP involving factors with the 
largest number of levels. The method to be described is, by contrast, based on an 
appropriate OMEP for factors having the smallest number of levels. The basic 
idea is, as a converse to method lb, to replace several factors with the smallest 
number of levels by a factor with a larger number of levels. More precisely, 
we can construct an OMEP for the s^ 1 x s; 2 x ... x s? factorial with sf runs, 
where is a prime or prime power, < S 2 , … < s q and 

q s i _ i 

TKni < ^~ - (14.17) 

si - 1 

1 = 1 

with 1 = 入 1 〈入 2 i 入 3 S … ；£ 入 ^ and 入 / (/ = 1, 2, . . q) an integer. The basic 
idea of this procedure is based on the following theorem. 

Theorem 14.2 In the with n = (s^ — l)/(.y — 1) and 5 a prime or 

prime power, a factor with t levels, s m ~ l < t < s m , can be introduced as a 
replacement for a suitably chosen set of y = (s m — l)/(s — 1) factors that will 
preserve the orthogonality of the main effect estimates. 
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Table 14.6 Orthogonal Main Effect Plan for 5 x 3 2 x 2 Factorial 



a. Auxiliary Plan 



b. Final Plan 


Fi 

f 2 

F 3 

Fa 

Fi 

Fi 

F 3 

Fa 

■ 

■ 



HI 


■ 

圔 


1 

1 

2 

■ ■ 

1 

1 

■3 


2 

2 

1 

^■1 

2 

2 

1 

1 
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1 

1 

1 

2 
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1 

2 

■ 

1 

2 


2 

1 

2 

EH 
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2 

2 

2 


2 


2 
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1 

2 

1 

EH 

1 

2 

2 

l 
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2 
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3 
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3 

1 

1 

2 

3 

l 

1 
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2 

1 

3 

2 

2 

1 

4 

MM 
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4 

0 

1 

1 

4 

1 
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4 

1 

2 

■ 

4 
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2 

4 

2 

EH 

H 
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MM 

2 

2 

^■1 


2 

m 

5 

1 


1 


1 

EH 

1 

5 

2 

l 

EH 

Bl 

2 

1 

■ 

Proof 

Let 

t = s m • There 

exists an 

v-(v-m) 

^III 

with v = (s m - 

- V/(s 

-l). 

Hence a 

factor with t = s m 

levels can 

replace v 

factors with 5 

levels 

each. 

If s m ~ x < t < s m , then a factor with s m levels can be collapsed. 
We illustrate this method with the following example. 


□ 


Example 14.5 We consider the 2 4 x 4 factorial, that is, = 2, 52 = 4, = 

4 , ^2 = 1. Now t = 4 = 2 2 , that is, m = 2 and v = 3 and a 4-level factor can 
replace three 2-level factors in the £ ) where n and i are determined as 

follows so that (14.17) is satisfied: With n\ = 4, n 2 = 1, and y = 3 we have for 
(14.17) with 入 2 = v = 3 

4 + 3 • 1 = n < (2^ - 1)/(2 - 1) 

which yields 1 = 3. Hence we can obtain the OMEP with N = 2 3 = S runs from 
the 2 jjJ 4 . The final plan is given in Table 14.7 using the replacement according 
to the 2 jjY J given below: 

Two-Level Factors in Four-Level Factor 


00 0 — 0 

Oil — ^ 1 

10 1 — > 1 

110 — 3 
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Table 14.7 Orthogonal Main Effect Plan for 2 4 x 4 Factorial 


\ 

Fi 

(10 0) 

Fi 

(0 10) 

F 3 

(110) 

Fa 

(0 0 1) 

F 5 

(10 1) 

Fe 

(0 11) 

Fi 

(111) 




a. Auxiliary Plan 



0 0 0 

0 

0 

0 

0 

0 

0 

0 

0 0 1 

0 

0 

0 

1 

1 

1 

1 

0 10 

0 

1 

1 

0 

0 

1 

1 

Oil 
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1 

1 

1 

0 

0 

10 0 

1 

0 

1 

0 

1 

0 

1 
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1 

0 

1 

1 

0 
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0 

110 

1 

1 

0 

0 

1 

1 

0 

111 

1 

1 

0 

1 

0 

0 

1 


b. Final Plan 


F[ Fa ^5 ^6 Fi 


0 

0 

1 

1 

2 

2 

3 

3 


0 0 0 
1 1 1 
0 0 1 
1 1 0 
0 1 0 
1 0 1 
0 1 1 
1 0 0 


0 

1 

1 

0 

1 

0 

0 

1 


It follows then that in the auxiliary plan (Table 14.5a) factors F\, F 2 , F 3 with 
two levels are replaced by factor F[ with four levels. □ 

This procedure of constructing an OMEP is most useful if s < t < s 2 . It 
follows then from Theorem 14.2 that the maximum number of f-level factors 
that can be introduced into the with n = (s^ — l)/(s — 1) is equal to 

(〆 一 1)/(^ — 1) if € is even, and equal to the largest integer less than or equal 
to (〆 一 1)/C? — 1) — 1 if € is odd. 

The three methods described above lead to a very large number of OMEPs for 
asymmetrical factorials. An extensive catalog of such plans is provided by Addel- 
man and Kempthome (1961)，Addelman (1962a, 1962b), and Dey (1985). Dey 
also lists some additional OMEPs not derivable by the Addelman - Kempthorne 
methods but instead derivable from Hadamard matrices or orthogonal arrays. For 
details the reader is referred to Dey (1985). 

For the 2 n x 3 m Addelman - Kempthome designs Margolin (1968) has inves¬ 
tigated the aliasing of main effects with 2-factor interactions. This is important 
for the reasons discussed in Section 14.2 for OMEPs for symmetrical factorials. 
We shall not discuss the rather complicated results here but refer to the method 
illustrated in Table 14.2. 
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14.4 NONORTHOGONAL RESOLUTION III DESIGNS 

Orthogonal main effect plans are useful because they are easy to analyze and the 
results are easy to interpret. Sometimes there may, however, be practical reasons 
why an OMEP may not be the most suitable design. In that case nonorthogonal 
MEPs may be more appropriate. We shall give some examples. 

As we have mentioned earlier, a saturated OMEP does not provide any degrees 
of freedom for the estimation of the error variance. If no information about the 
error variance is available, additional observations must be obtained. To replicate 
the OMEP may lead to too large an experiment. Hence only some runs may 
be replicated leading to nonproportional frequencies and as a consequence to 
correlated estimates of the main effects. 

In contrast to the just described situation an OMEP may not be a saturated 
plan and may therefore have too many runs. It is always possible to obtain a 
saturated MEP. One such plan is the one-at-a-time plan, where if the runs are 
considered in sequence, only the level of one factor is changed from one run to 
the next. As an example consider the following MEP for a 2 8 in nine runs (apart 
from randomization): 


Run F\ F 2 F 3 F 4 7^5 Fj 


1 

2 

3 

4 

5 

6 

7 

8 
9 


0 0 
1 0 
1 1 
1 1 
1 1 
1 1 
1 1 
1 1 
1 1 


0 0 
0 0 
0 0 
1 0 
1 1 
1 1 
1 1 
1 1 
1 1 


0 0 
0 0 
0 0 
0 0 
0 0 
1 0 
1 1 
1 1 
1 1 


0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
1 0 
1 1 


An alternative to the plan above is one where each run, other than the control, 
has only one factor at the high level and all other factors at the low level (apart 
from randomization): 


Run F\ F 2 F 4 F 5 Fj 


1 

2 

3 

4 

5 

6 

7 

8 
9 


0 0 
1 0 
0 1 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 


0 0 
0 0 
0 0 
1 0 
0 1 
0 0 
0 0 
0 0 
0 0 


0 0 
0 0 
0 0 
0 0 
0 0 
1 0 
0 1 
0 0 
0 0 


0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
0 0 
1 0 
0 1 
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Obviously, this method can be adapted to both symmetrical and asymmetrical 
factorials with factors at any number of levels. It should be clear to the reader 
that if a saturated OMEP exists for a given factorial it will be more efficient than 
the comparable nonorthogonal MEP. But often statistical properties may have to 
be balanced against practical considerations. 

In listing the designs above we have added for emphasis the cautionary 
note “apart from randomization.” There may be settings, especially in indus¬ 
trial applications, where it would be most economical and feasible to execute the 
experiment using the run order given above. This may have serious consequences 
for the analysis and interpretation, since not only is there lack of randomization 
(see 1.9.9) but this would also invoke a split-plot type of experiment if the levels 
of factors are not reset for each run (see 1.13). For a discussion of such problems 
see Daniel (1973) and Webb (1968). 



CHAPTER 15 


Supersaturated Designs 


15.1 INTRODUCTION AND RATIONALE 

We have seen in the preceding chapters that very often practical considerations 
lead the experimenter to using fractional factorial designs. The aim, of course, is 
to reduce the size of the experiment without sacrificing important information. 
Even then, the suggested design may be quite large. This may be true even for the 
most highly fractionated design, the saturated main effect plan, if the number of 
factors under investigation is large. And it is not unusual that in many physical, 
chemical, or industrial experiments the number of factors that one would like to 
study is quite large, but consideration of cost calls for a small number of runs, 
that is, treatment combinations. Moreover, it is suspected that among the large 
number of factors only few are important or active. Such situations have led to 
the development of what are called supersaturated designs (for a definition see 
Section 15.3). 

The idea is to somehow select N treatment combinations from the totality of 
all possible treatment combinations, say from the a x 乃 x ... x & combina¬ 
tions of a factorial with factors A\, A 2 ,..., A n where factor A/ has pi levels 
(i = 1,2,... ,n) and n > N — l. Obviously, orthogonality as with fractional fac¬ 
torials can no longer be achieved. Rather, orthogonality is being replaced by some 
measure of near orthogonality. Also the analysis of data from such an experiment 
will have to be approached differently from the usual analysis of factorial exper¬ 
iments. Simultaneous estimation of effects will no longer be possible. Instead, 
based on the assumption of scarcity of active factors and absence of interactions, 
stepwise forward regression provides a plausible approach. 

15.2 RANDOM BALANCE DESIGNS 

One way to select the N treatment combinations is by using a random sampling 
process. This idea has led to the notion of random balance designs (Satterth- 
waite, 1959). 
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Here the notion of random balance is understood in the following sense 
(Raghavarao, 1986): Factor Aj is randomly balanced with respect to factor A/ 
if the random sampling process used to select the p } levels of factor Aj is the 
same for each level of factor A /. If every factor has random balance with respect 
to every other factor, then Satterthwaite (1959, p. 113) refers to this as a pure 
random balance design. 

Satterth waite (1959, p. 114) states that “pure random balance designs are 
almost always inefficient in some sense” (see Section 15.4.1), but at the same 
time he argues that “there are important and large classes of problems for which 
these inefficiencies are often unimportant or even trivial.” These views were 
largely contradicted by Youden et al. (1959), who, among other things, called for 
a systematic, rather than random, construction of designs for the earlier stated 
purpose. 


15.3 DEFINITION AND PROPERTIES OF 
SUPERSATURATED DESIGNS 


Generalizing some results of Yamada and Lin (1999)，we consider an experi¬ 
ment with n factors A\, A 2 ,..., A n , each with p levels denoted by 1, 2,..., p. 
Let N be the number of runs (treatment combinations) with (i) (p — \)n > 
N — \ and (ii) N = p . q and q a positive integer. Further, let C(A0 be the 
set of -dimensional vectors each containing each of the p levels exactly 
q times. 

A p-level supersaturated design D can be described as a selection of n vectors 
in Q(N), say ci, C 2 ,..., c n , by some specified rule. The design D can then be 
represented by the TV x « array 


[C]_ ， C2，.. • ， c"] (15.1) 

Obviously, the rule by which the selection leading to (15.1) is made is of major 
importance. As mentioned earlier, orthogonality of the vectors in (15.1) cannot 
be achieved, but we would like to stay as “close” as possible to this property, 
which is exhibited in the Fisher and Plackett-Burman main effect plans (see 
Sections 14.2.1 and 14.2.4). Following Yamada and Lin (1999) we shall adopt 
the following x 2 statistic as a measure of deviation from orthogonality and choose 
the design D, which achieves the smallest amount of deviation. Such a design 
we declare to exhibit near-optimality. 

Let n ab (ci, cj) denote the number of rows in the N x 2 array [c/, Cj] whose 
values are [a, b], where cj, (i < j) are two columns from (15.1). Obviously, 

p 

n a \ Ci ,Cj) = N 

a,b=\ 
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for any pair i, j(l < i < j < n). The x 2 statistic mentioned above is then 
defined as 


2 _ [n ab (Cj, Cj)- N/p 2 ] 2 


(15.2) 


For orthogonality (or complete independence) of C[ and Cj we have n ab (Ci, Cj)= 
N/p 1 for each pair [a, b] (1 < a, b < p), whereas for complete confounding of 
factors A/, Aj (or complete dependence) we have, without loss of generality, 

, [N/p for a = b 

[0 otherwise 

It is then easy to see that the x 2 value of (15.2) lies between 0 and (p — l)N. 

For the overall evaluation of deviation from orthogonality for the design D, 
we then use either 



E 

^.j 






av x 2 = 

i<j 


(15.3) 

n(n — 

1)/2 

max x 2 = 

max x 

n 1 <« < y < « 

(15.4) 


with Xij given in (15.2). Using either (15.3) or (15.4) the design D with the 
smallest value will be chosen among competing arrangements. 

For an extension of the x 2 criterion to mixed-level supersaturated designs see 
Yamada and Matsui (2002). 


15.4 CONSTRUCTION OF TWO-LEVEL 
SUPERSATURATED DESIGNS 

15.4.1 Computer Search Designs 

Most of the emphasis of devising rules for obtaining a suitable array (15.1) has 
been concentrated on 2-level supersaturated designs, that is, p = 2, denoting 
the levels by —1 and +1. Booth and Cox (1962) provided the first set of such 
designs for the following pairs of (n, N) values: (16, 12), (20, 12), (24, 12), 
(24, 18), (30, 18), (36, 18)，and (30, 24). These designs were obtained through 
computer search. 
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To compare their designs with random balance designs, Booth and Cox (1962) 
consider Sij = c\cj (i < j) as a measure of nonorthogonality of factors A/ and 
Aj. As an overall measure of the non-orthogonality for the design under consid¬ 
eration, they then define 


TA 


E(s 2 )= 


i<j 

n(n — 1)/2 


(15.5) 


Yamada and Lin (1999) show that (15.5) is equivalent to (15.3) for p = 2. For 
a random balance design (15.5) is given by 


Erbd(^ 2 )= 


N 2 

N - 1 


(15.6) 


(Booth and Cox, 1962). It turns out that for all designs constructed by Booth and 
Cox (1962) ^ ‘ 


E(s 2 ) < ErbdO 2 ) 

indicating a smaller degree of nonorthogonality, that is, less confounding among 
the factors, of their designs, a result that holds true in general for other supersat¬ 
urated designs as well. 


15.4.2 Hadamard-Type Designs 

A more systematic way to construct a special class of 2-level supersaturated 
designs was provided by Lin (1993). Just as the Plackett-Burman designs (see 
Section 14.2.4) are based on Hadamard matrices of order M, so are the proposed 
supersaturated designs. Their construction can be described in general as follows: 
Consider a Hadamard matrix of order M (which is not constructed via foldover) 
in seminormalized form. Delete the first column of +l’s and label the remaining 
columns 1, 2,..., M — 1. Choose one of these columns as the branching column, 
that is, the column that divides the Hadamard matrix into two half-fractions 
according to its +l’s and — l’s. Delete the branching column and choose a sign, 
say +. Then the M/2 rows that corresponding to +1 in the branching column 
form a supersaturated design with n = M — 2 factors and N = M/2 runs, where 
each factor is, of course, represented M/4 times at the high and low level. 

Letting ^ be an M x M Hadamard matrix, then the procedure described 
above can be formally represented as 


H 


0 H h 

-0 * 


(15.7) 
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(Bulutoglu and Cheng, 2003), where 0 represents a vector of N/2 unity elements 
and H h represents the supersaturated design, that is, the columns of H h represent 
the columns [c\, C 2 ,..., c n ] of (15.1) with n = M — 2. 

Example 15.1 We shall illustrate this method with the design for M = 12 
given by Lin (1993). Table 15.1 contains the Hadamard matrix in the form indi¬ 
cated by (15.7), where + represents +1 and — represents —1. 

It is easy to see that for this from (15.5) design all | Sjj | = 2 and hence 
E(s 2 ) = 4, compared to £"rbd(s 2 ) = 7.2 from (15.6). □ 

For n < M — 2 we can delete certain columns from the complete design. In 
practice it does not matter which column we choose as the branching column 
and which columns we delete subsequently (Lin, 1993). 

Another Hadamard-type supersaturated design was proposed by Wu (1993). 
Starting with a seminormalized Hadamard matrix of order M, it makes use of all 
its columns except the first and augments it by interaction columns to generate a 
supersaturated design with N = M runs for n factors, where M < n < 2M — 3. 
The procedure can be described as follows. 

Let the generator Hadamard matrix H be given by H = (3, x\, X 2 ,..., 
Xm-i)- Obtain M — 2 interaction columns Xi 0 j = Xi 0 o xj as the elementwise 
product of Xi 0 and xj (j ^ io, io fixed). For cyclic Hadamard matrices we 
can choose io = 1 ； for noncyclic Hadamard matrices we choose io to minimize 
max I jc• Xi Q j I over i and j. Thus, (15.1) is given by 

[xi ， X2, •.. ， XM-\,Xi 0 j(j = 1 ， 2, ...，M — 1; ) # io)] 


Table 15.1 Supersaturated Design for w = 10 Factors in N = 6 Runs 


Factor 


Row 1 2 3 4 5 6 7 8 9 10 Run 
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Among all inner products of the columns only jc; Xi Q j with /, io, j all different 
may be nonzero. Hence the choice of io ensures that the confounding among 
factors is kept to a minimum. It can be shown (Wu, 1993) that for the above 
designs 


E(s 2 ) 


M 2 (n -M + 1) 
n(n — 1)/2 


(15.8) 


Ifn< 2M — 3, we simply choose n — M + 1 columns from the M — 2 interac¬ 
tion columns, keeping E(s 2 ) as small as possible. 


A third method to construct Hadamard-type supersaturated designs was pro¬ 
posed by Tang and Wu (1997). The method essentially consists of adjoining k 
Hadamard matrices (after deleting their first column of ones) of order M. The 
resulting array yields a supersaturated design for fz = k(M — 1) factors in M runs. 
For n = (k — 1)(M — l) j factors (with 1 < y < M — 2) we delete M — \ — j 
from the full design, that is, the design for k(M — 1) factors. 

We shall conclude this section by making some remarks about the possible 
optimality with regard to the E(s 2 ) criterion. Nguyen (1996) and Tang and Wu 
(1997) have shown that a lower bound for E(s 2 ) is given by 




(15.9) 


Nguyen (1996) has shown that the half-fraction Hadamard designs of Lin (1993) 
achieve the lower bound (15.9). This is true also for the full designs given by 
Tang and Wu (1997). Comparing (15.8) with (15.9) shows that for n = 2M — 3 
the designs of Wu (1993) have E(s 2 ) values only slightly larger than the lower 
bound. A similar statement can be made about the designs of Tang and Wu (1997) 
when n = (k — l)(M — 1) + j. Obviously, designs that achieve the lower bound 
are £ , (5 ,2 )-optimal. 


15.4.3 BIBD-Based Supersaturated Designs 

Interesting relationships between certain BIB designs and supersaturated designs 
were established by Nguyen (1996), Cheng (1997), and Liu and Zhang (2000). 
We shall not go into all the details here, but just describe one method based 
on the cyclic development of initial blocks as described by Liu and Zhang 
(2000). They establish the equivalence of existence of a supersaturated design 
with N = 2v, n = c(2v — 1) (with v > 3, c > 2 and when v is odd, c is even) and 
that of a BIB design with t = 2v — 1, b = c(2v — 1), r = c(v — l), k = v — l, 
入 = c(v — 2)/2, which is cyclically developed from c initial blocks. 
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Given a BIB design with parameters specified above and the treatments labeled 
as 0, 1, 2,..., r — 1 = 2 d — 2, the procedure to generate a supersaturated design 
can be described as follows: 

1. Develop the first initial block cyclically, giving rise to blocks 1,2,..., 

2 v- 1 . ^ i 

2. Consider an array of 2i> — 1 columns, each column containing the elements 
0, 1, 2, ... ， 2i» — 1 (in that order), which correspond to the 2v — l blocks 
obtained in step 1. In column j replace the v — \ elements corresponding 
to the treatments in block j by +1 and the remaining v elements by — 1. 

3. Add a +1 in the first position to each column, thus generating the first 
2v — l columns of the array (15.1). 

4. Repeat steps 1, 2, and 3 for each of the remaining c — 1 initial blocks, thus 
generating altogether the n = c(2v — 1) columns of (15.1). 

We shall illustrate this procedure in the next example. 

Example 15.2 For u = 4, c = 2 the two initial blocks are given by Liu and 
Zhang (2000) as (0 1 3) and (0 1 5). The individual steps then are: 

(1) Develop (0 13) and (0 1 5) mod 7: 






0 

1 

2 

3 4 

5 

6 









1 

2 

3 

4 5 

6 

0 









3 

4 

5 

6 0 

1 

2 









0 

1 

2 

3 4 

5 

6 









1 

2 

3 

4 5 

6 

0 









5 

6 

0 

1 2 

3 

4 





⑵ + (3) + ⑷ and 

writing 

+ for +1 and 

1 - 

for 

-1: 




+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

+ 

— 

— 

— 

+ 

— 

+ 

+ 

— 

+ 

— 

— 

— 

+ 

+ 

+ 

— 

— 

— 

+ 

— 

+ 

+ 

— 

+ 

— 

— 

— 

— 

+ 

+ 

— 

— 

— 

+ 

— 

+ 

+ 

— 

+ 

— 

— 

+ 

— 

+ 

+ 

— 

— 

— 

— 

— 

+ 

+ 

— 

+ 

— 

— 

+ 

— 

+ 

+ 

— 

— 

— 

— 

— 

+ 

+ 

— 

+ 

— 

— 

+ 

— 

+ 

+ 

— 

+ 

— 

— 

— 

+ 

+ 

— 

— 

— 

— 

+ 

— 

+ 

+ 

— 

+ 

— 

— 

— 

+ 

+ 


The columns represent the supersaturated design (15.1) with N = S, n = 14. 
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Liu and Zhang (2000) provide a list of initial blocks for constructing a large 
number of supersaturated designs with N = 2v and n = c(2v — 1) for 3 < i; < 
12. All these designs are ^Cs 2 ) optimal. □ 

15.5 THREE-LEVEL SUPERSATURATED DESIGNS 

Clearly, from a practical point of view two-level supersaturated designs are the 
most important. But in some situations three-level designs may be called for if 
the few active factors are expected to exhibit linear and/or quadratic effects. 

Whereas there exist several methods of constructing two-level supersatu¬ 
rated designs, there exist only very few for constructing three-level designs (see 
Yamada and Lin, 1999; Yamada et al., 1999; Fang, Lin, and Ma, 2000). We shall 
describe briefly one method due to Yamada and Lin (1999). 

Let C = [c\, C 2 ,..., c n ^\ be a two-level supersaturated design or saturated 
main effect plan in N* runs. Let c/) ab (C) be an operator that changes the elements 
in C from —1 (or —) to a and +1 (or +) to b. Then consider the matrix 

> 12 (C) 0 12 (C) 0 13 (C) 0 23 (C)- 

D = 少 23 (C) 0 13 (C) 0 23 (C) 0 12 (C) (15.10) 

W) 0 23 (C) 0 12 (C) <t> n {C)_ 

D 2 D 3 D 4 ] (15.11) 

Array (15.10) represents a three-level supersaturated design with N = 3N* runs 
and n = 4n* factors. Each level of each factor occurs N* times, and In > N. 

For n = 2n* or n = 3n* a combination of subdesigns D/ (/ = 1, 2, 3, 4) from 
(15.11) can be used to form a suitable supersaturated design. We shall illustrate 
this with the following example. 

Example 15.3 Suppose we want to examine 14 three-level factors accounting 
for 28 effects, and suppose further that we are allowed 24 runs. A suitable design 
can be constructed by choosing for C the saturated main effect plan with = 8 
and n* = 7, given by the following Hadamard matrix 
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Using the max x 2 criterion (15.4), Yamada and Lin (1999) show that if 
max I Sij \ = d, say, over all /, j (1 < / < j < n*), then for the design D in 
(15.10) or a combination of subdesigns from (15.11) we have 

2 \ (N + 9d) 2 N 

max y = max { - , —— 

A ( SN 2 J 

This result is useful because it allows us to compare competing designs with 
regard to the maximum dependency between two factors, the smaller this value, 
the better the design. 


15.6 ANALYSIS OF SUPERSATURATED EXPERIMENTS 

Because the number of runs, N, is smaller than the number of parameters to 
be estimated, (p — l)n, we cannot apply the usual analysis to the data from a 
supersaturated experiment, that is, an experiment using a supersaturated design. 


Using I >2 and D 3 from (15.11) and applying the transformations to (15.12) 
yields the supersaturated design D: 


□ 


113133312232333211212221 

131333112323332212122211 

313331113233322221222111 

133311312333223212221121 

333113113332232222211211 

331131313322323222112121 

311313313223233221121221 

112122211131333122323332 

121222111313331123233322 

212221113133311132333222 

122211211333113123332232 

222112113331131133322322 

221121213311313133223232 

211212213113133132232332 
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Therefore, the analysis of such data must be predicated on the principle (or 
assumption) of active factor sparsity (Box and Meyer, 1986a, 1986b) or, in the 
case of p-level designs, effect sparsity as each factor is associated with p — 1 
effects. Furthermore, we assume that all interactions are negligible. 

To put this in the context of a linear regression model, we express the obser¬ 
vation vector y f = (yi ，： y2, … ， yN) as 

y = Xp-^e (15.13) 

where X is an N x [(p — l)n + 1] known matrix, which is determined by the 
design matrix, P is the [(p — l)n + 1] vector of unknown parameters, and e is the 
error vector with the variance - covariance structure for a completely randomized 
design (see 1.6.3.4). Based on the assumption of effect sparsity we may rewrite 
(15.13) as 

y = (Xu X 2 )^ + e (15.14) 

where P j represents the vector of nonzero or large effects (including the intercept) 
and represents the vector of zero or negligible effects. Our aim, obviously, is 
to identify ) 81 , that is, the “true” model. 

This necessitates applying variable selection procedures. An obvious choice is 
stepwise variable selection. This method may, however, not always be the best 
choice. If the number of factors is very large, the probability is high that some 
of the inert factors may be selected (type I error). Also, mainly because of the 
nonorthogonality of the X matrix, it may not always select active factors (type II 
error). A thorough discussion of these difficulties has been provided by Westfall, 
Young, and Lin (1998). To overcome in particular type I error problems, they 
recommend using adjusted P values at each step of the stepwise procedure, using 
family wise error rates. The family wise error rate should not be higher than .50 
for inclusion of a variable in the model. 

It is for these reasons that Abraham, Chipman, and Vijayan (1999) recommend 
the use of all-subsets regression. For obvious reasons, this method has to be 
implemented in such a way that only subsets up to a certain prespecified size 
(< N) will be considered. An advantage of this method is that it may produce 
different competing models, which allow the experimenter to make a choice 
based perhaps on subject matter considerations. 

Using similar arguments against the use of stepwise variable selection, Li and 
Lin (2002) propose a penalized least-squares approach to the variable selection 
problem. Let us rewrite (15.13) as 

j,- e t (i = 1 ， 2 ， ... ， N) (15.15) 

where X/ is the vector of input variables. For example, for the two-level design 
(15.15) is i 


只 =A) + ^2i^2 + • . . + ^ni^n 
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or for the three-level design we have 

yi = A) + x \iP\ + -^ 1/^11 + x 2i^2 + x 2i^22 + * * * + x niPn + X ni^nn + e i 

A form of penalized least squares is defined as 

1 N (p-l)« 

E ^d^i) (i5.i6) 

i=l j=0 


(Fan and Li, 2001; Li and Lin, 2002), where px N (') is a penalty function and 入 # 
is a tuning parameter, which is usually chosen by some data-driven approach. 

Since this method cannot be applied to the full model (15.15), Li and Lin 
(2002) recommend to proceed in two steps: Apply stepwise variable selection 
(or subset selection, we might add) in step 1, using a low threshold, that is, a 
high a value, and then, in step 2, apply (15.16) to the model obtained in step 1. 
More formally, in (15.16) P = and x/ = Xu, say, where Xu conforms to 
An appropriate penalty function for supersaturated experiments is the smoothly 
clipped absolute deviation (SCAD) penalty, whose first-order derivative is 

<(|/i|)= 入 {/(l^l <A) + > 入） (15.17) 

for |3| > 0 with a = 3.7, and p 入 (0) = 0 (Fan and Li, 2001; Li and Lin, 2002). 

Suppose Pi in (15.14) has been identified in step 1 above to consist of d 
terms, say ^ ⑴， 0 ( 2 ),..., 0 ⑷. Let be an initial value close to the true value 
Pi ，when ^( y ) is not very close to zero. The penalty function (15.17) can be 
locally approximated by a quadratic function as 


: p 入 (IA))I)]’~ 



(15.18) 


and = 0 if is close to zero. With the local approximation (15.18), the 
solution to the penalized least square can be obtained iteratively as 


with 


^8^ = jz ； Z 1+ ^ V ^ (<)) 厂 Vj 



(15.19) 


For more details about the ridge regression (15.19) and description of the prop¬ 
erties of the penalized least-squares estimators we refer the reader to Li and 
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Lin (2002). See also Li and Lin (2003) and Holcomb, Montgomery, and Carlyle 
(2003) for a comparison of analysis procedures. 

Regardless of which type of analysis is used, special care has to be exercised 
to interpret the results. We always have to remember that supersaturated designs 
are used for screening purposes, and the primary objective is to eliminate a large 
portion of inactive factors from further consideration. The nature of the designs 
do not guarantee that we always find the right answer. In any case, smaller 
follow-up experiments will be needed. 



CHAPTER 16 


Search Designs 


16.1 INTRODUCTION AND RATIONALE 

The usefulness of factorial experiments is based on the premise that high-order 
interactions are negligible. In previous chapters we have shown how this premise 
can be used to construct useful and efficient factorial designs through confounding 
and fractionation. Extreme examples of the latter category are, of course, main 
effect plans and supersaturated designs. They are based on the severe and often 
nonsustainable assumption that all interactions are negligible. 

It is this point that leads to the question whether one can construct designs 
that (1) have a relatively small number of runs and (2) allow the exploration 
of a limited number of low-level interactions. The emphasis here is on “lim¬ 
ited number,” because even for a 2 n factorial there are n(n — 1)/2 two-factor 
and n(n — l)(n — 2)/6 three-factor interactions and to investigate them all would 
necessitate a resolution VII design. In most cases this may lead to a prohibitively 
large number of runs. But how can we limit the number of interactions to be inves¬ 
tigated since we do not know which of the interactions are possibly nonnegligible? 

Clearly, we need to make some assumptions to overcome this difficulty. Rather 
than specifying the particular interactions we would like to investigate, we specify 
the number of interactions to be investigated. In other words, we assume that a 
specified number, say 灸 ， 2- or 3-factor interactions are possibly nonnegligible. We 
would then like to construct a design that allows us to (1) identify these nonnegli¬ 
gible interactions and (2) estimate these effects in addition to the main effects. This 
idea was first proposed and investigated by Srivastava (1975) when he introduced 
the concepts of search design and search linear model. We shall describe in this 
chapter some of his results and subsequent developments on this topic. 

16.2 DEFINITION OF SEARCH DESIGN 

Let us consider here the case of n factors, each at two levels. We know from 
Section 7.4.2 that the true response for each treatment combination can be 
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expressed as a linear function of the mean, all main effects and all interactions. 
We can divide the 2 n — l factorial effects into three categories: 

Cl ： contains the effects we want to estimate in any case, such as the main 
effects; 

62 - contains effects we are interested in, but we know that only a few of 
them are nonnegligible, such as 2 -factor interactions, 62 ( 2 ), or 2 - and 3-factor 
interactions, 62 (^, 3 )； 

63 ： contains the remaining effects (such as higher-order interactions), which 
we consider to be negligible. 

Based on the above scenario we can then express a set of N observations, 
given by the x 1 vector y, in terms of the following linear model: 

y = + A 2^2 + e (16.1) 

where A\, A 2 are known matrices of order N x v\ and N x V 2 , respectively, 
^ is a vi x 1 vector of unknown parameters consisting of the mean /x and the 
effects in Ci, 夸 2 is a v »2 x 1 vector of unknown parameters pertaining to the 
effects in 62 , and ^ is the x 1 vector of errors with 

E(e) = 0 var(^) = V (16.2) 

where V represents the variance - covariance matrix for e under the error structure 
of the completely randomized design (see Section 1.6.3.5). 

We assume now that at most k of the effects in C 2 are nonnegligible, where k is 
quite small compared to V 2 . Let D be the design that gave rise to the observations 
y in (16.1). If y and hence A\ and A 2 are such that we can estimate § 1 and the k 
nonnegligible effects of § 2 , then D is said to be a search design of resolving (or 
revealing) power (^; $ 2 , k) and the model (16.1) is said to be a search linear 
model (Srivastava, 1975). 

16.3 PROPERTIES OF SEARCH DESIGNS 
16.3.1 General Case 

In the words of Srivastava (1984), model (16.1) represents a supermodel from 
which we want to extract an individual model based on the above criteria. Obvi¬ 
ously, a search design that accomplishes this task must have certain properties. 
More specifically, the search design must have a sufficiently large number of 
runs and it must include a particular subset of all possible runs. 

Consider a design D with N runs and suppose that D is composed of two parts 
or subdesigns, D\ with N\ runs, and D 2 with N 2 runs. Let the observation vector 
y in (16.1) be partitioned conformably as y f = (〆” y f 2 ), where is N\ x l and 
is associated with D\, while y 2 is A ^2 x 1 and is associated with D 2 . Very often 
D\ represents a main effect plan to which treatment combinations in the form 
of D 2 are added to generate a search design. In essence, this is a process of 
dealiasing main effects from certain interactions. 
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A fundamental result that provides a characterization of the properties that 
such a design must possess is due to Srivastava (1975) and can be stated as 
follows: 

Theorem 16.1 (i) Given model (16.1)，a necessary condition for a design D 

to be a search design of revealing power ( 夸 ^ § 2 , /:) is that 

rank(Ai, A 20 ) = v\-\-2k (16.3) 

for every N x 2k submatrix A 20 of A 2 . 

(ii) In the noiseless case, that is, for ^ = 0 in (16.1) and (16.2), the condition 
(16.3) is also sufficient. 

We shall comment briefly on the noiseless case. It implies, of course, that there 
is no experimental error and no observational error (see Section 1.6.3.4). Such a 
situation does not occur in practice. Even so, the noiseless case is important with 
regard to the construction of search designs, for if the structure of designs D\ and 
D 2 and hence the structure of A\ and A 2 in (16.1) is such that the search and esti¬ 
mation problem cannot be solved in this case, then it certainly cannot be solved 
when noise is present. Moreover, in the noiseless case the observations y in (16.1) 
are nonstochastic and hence the search problem is deterministic. This implies that 
the nonnegligible effects in can be identified with probability one if D satisfies 
the condition (16.3) of Theorem 16.1 In the stochastic case such identification 
can be achieved only with a certain probability, usually less than one. 

The following theorems provide not only a tool to check whether for a given 
design D condition (16.3) is satisfied but can also be used to obtain more explicit 
conditions that a search design has to satisfy. 

Theorem 16.2 (Srivastava and Ghosh, 1977) Consider model (16.1) and 
suppose that rank(Ai) = v\. Then condition (16.3) holds if and only if for every 
N x 2k submatrix A 20 of A 2 we have 

rank[A2 0 A2o — A f 2 QA\(A\A\)~ l A\A2 o\ = 2k (16.4) 

A similar result is given in the following theorem. 

Theorem 16.3 (Srivastava and Gupta, 1979) Consider model (16.1) and 
suppose that rank(Ai) = v\. Let A\\ be a v\ x v\ submatrix of A\ of rank 
v\. Rearrange the observation vector y and write (16.1) as 

(;;HX) … （ i6 . 5 ) 


where j*, An, and A 12 have v\ rows and y|, A 21 , and A 22 have N — rows. 
A necessary and sufficient condition for (16.3) to be satisfied is that every set of 
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2k columns of 

Q = A 22 -A 2l AY l l A l2 (16.6) 


are linearly independent. 


Note that in Theorem 16.3 the vector j* is not necessarily equal to y j obtained 
from D\ and, hence, is not necessarily equal to obtained from D 2 as 
described earlier. However, if D\ is a saturated main effect plan with Ni = vi = 
"+ 1, then j* = and, hence, = j 2 . 

16.3.2 Main Effect Plus One Plans 

As stated earlier, in the context of search designs we assume a priory that at 
most k effects in of (16.1) are nonnegligible, where k is generally small. The 
most important and tractable situation from both the theoretical and practical 
point of view is the case 众 =1. In addition, the most obvious starting point is to 
consider the case where ^ j is the vector of the mean, /x, and the n main effects, 
say F\, F 2 ,..., F n , of the n factors F\, F: ， … ， F n , each at two levels. Thus, D\ 
might be a main effects plan. The resulting search design D is then referred to 
as a main effect plus one plan, or MEP.l for short, with resolution III.l. 

Let us denote a treatment combination by x f = (jci, ^ 2 ,..., x n ) with xi = 0 
or 1(1 = 1,2,..., n). Then we denote the Ni treatment combinations of Di(i = 
1, 2) by x 、 = (xij\,Xij 2 , . • •, Xij n ) for y = 1, 2,..., Ni. Now consider the fol¬ 
lowing D\ with N\ = n -\-2 treatment combinations and given by its design 
matrix 


/t\ 

Dl = In (16.7) 

Vo：,/ 

Recall from Section 7.4.2 that the true response of treatment combination x can 
be expressed as 


ci{x) = /^ + — E< 

OL 

or, if we define E* a = as 

a(x) = /x + (16.8) 


Taking j* to be the first n-\- l observations in D\ it follows from (16.7) and 
(16.8) that An in Theorem 16.3 can be written as the patterned matrix 


An = 



3; 

21 n - Dnt 
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with A~[i having the same pattern. Following lengthy but elementary arguments 
we can then derive the general form of Q in (16.6) such that the conditions 
of Theorem 16.3 are satisfied (see Srivastava and Gupta, 1979), leading to the 
following theorem. 

Theorem 16.4 (Srivastava and Gupta, 1979) Let D = (^) be the design 
matrix for the design D, where D\ is given by (16.7). Let D21 and D22 be any 
two submatrices of D 2 of order (A^ — vi) x and (A^ — v\) x n 2 , respectively, 
with n\, n，2 > 2. Let riij be the number of ones in the ith row of (j = 1 , 2) 
and let for an integer z 

z if z is even 

Hz)=. 

z — 1 if z is odd 

Then a necessary and sufficient condition for D to be a search design, where 

at most one element of is nonnegligible, is that for every pair of distinct 
submatrices D 21 and D 22 there exists a row in D 2 (say, the ith row), such that 

0("/l) ♦ Hn i2 ) 

0Oi) 0(«2) 

[where i may be different for different pairs (D 21 , ^ 22 )]- 


This provides a convenient and simple check on D 2 for D to be an MEP 丄 
Whereas for the MEP.l discussed above the vector in (16.1) consists of 
all possible interactions in 2 n factorial, it seems more reasonable to restrict 
to include only 2 - and 3-factor interactions, such that = Q) + ( 3 ). A search 
design for identifying at most one nonnegligible effect among those interactions 
has been referred to by Gupta (1990) as a weak MEP.l. The structure of such an 
MEP.l has been considered by Ghosh (1981) and Ohnishi and Shirakura (1985), 
and we shall give a brief description of their results without going through all 
the derivations. 

Let D\ be given by its design matrix 

f ^ ) 

Di = 0；, (16.9) 

where D\ consists of N\ = n-\-2 treatment combinations and represents a main 
effects plan. Let denote the observation vector obtained from D\ and let 


Jl = ^1^1 + ^2^2 + e \ 


( 16 . 10 ) 
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with and as defined above. Using the parameterization (16.8) it follows 
that 


/i 3 ； \ 

A* { = 1 -3 r n (16.11) 

\^n — 21 n J 

Now the columns of in (16.9) and of A 2 in (16.1) are obtained by appropriate 
multiplication of the elements in the columns of A* (and A\). For example, to 
obtain the column corresponding to Fi x Fi 2 in A 2 (and in A 2 ) we simply multiply 
the corresponding elements of the columns belonging to Fi l and F/ 2 . Since D\ 
is a main effects plan, the columns of are linear combinations of the columns 
of A*. This fact can be exploited to obtain conditions for D 2 such that condition 
(16.3) of Theorem 16.1 is satisfied for 众 =1. This is done by considering all 
possible types of pairs of 2 - and/or 3-factor interactions such as (Fi“ 2 , Fi^), 
Fi 3 i 4 ), (Fi { i 2 , Fi { i 3 i 4 ), and so forth with distinct indices i\, h, h, 4 . This 
leads to the following theorem. 

Theorem 16.5 (Ohnishi and Shirakura, 1985) Let co(z f ) denote the weight 
of the vector z, which is defined as the number of nonzero elements in z. Further, 
let D\ be given by (16.9) with row vectors of weights 0, — 1, and n. Then a 

necessary and sufficient condition on D 2 with N: treatment combinations, such 
that Z) = Di + D 2 is a search design, is that there exist treatment combinations 
(x\,X2 ,..., x n ) in D 2 satisfying the following conditions for any distinct integers 
i k {k = 1, 2, 3, 4, 5, 6) belonging to the set {1,2,..., n}\ 


(1) 

0 ) {xi j, X12 ) = 1 

•^• 3=0 


⑵ 

0 ) {x，i j, X12 ) = 1 

⑺ (*^' 3 ， X/ 4 ) = 1 


(3a) 

(DiXi^X^) > 1 

⑺ ( 又 /3, 义“） = o 

or 

(3b) 

CO (^X[ j, X12 )—0 

0)(x i 3 ,x i4 ) > 1 


(4a) 

^i\ = 0, X12 — 1 

⑴ (*^' 3 ， X/ 4 ) = 1 

or 

(4b) 

OjiXi^X^) > 1 

⑺ ( 义 /3, 义 /4) = 0 

or 

(4c) 

o) {x，i j, X12 ) — 0 

⑺ ( 义 /3, 义 / 4 ) = 2 


(5a) 

o) {x，i j, X12 ) — 2 

文 /4) = 0 

or 

(5b) 

co {^Xi j, X12 ) — 0 

⑺ 0 ^ 3 , 义 / 4 ) = 2 

or 

(5c) 

⑺ (X “， xw /4 ) = 1 

*^•5 = 1 

or 

(5d) 

⑺ (x“w 2 , a 3 ， x /4 ) = 3 

•^• 5=0 
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( 6 a) 

co {xi j, X 12 )—0 

oj(x i3 ,Xi 4 ,Xi 5 ) >2 or 

( 6 b) 

> 1 

co(x i3 ,x i4 ,Xi 5 ) < 1 

(7a) 

(DiXi^Xi^Xi^) < 1 

^>(x i4 ,Xi 5 ,Xi 6 ) >2 or 

(7b) 

(oixi^Xi^Xi^) > 2 

co(x i4 ,x i5 ,x i6 ) < 1 


Even though the conditions in Theorem 16.5 are intended to provide a mech¬ 
anism for checking condition (16.3) for all possible submatrices A 20 , they them¬ 
selves are not very easy to check since the number of cases to be considered is 
quite large. The conditions prove, however, to be useful in a sequential construc¬ 
tion of search designs (see Section 16.4). 

16.3.3 Resolution V Plus One Plans 

In the context of fractional factorial designs resolution V designs are of particular 
importance as long as the assumption of negligible interactions involving three 
or more factors is reasonable. If we suspect, however, that at most k of those 
interactions are nonnegligible, then a search design may be appropriate. Again, 
the case k = \ is of interest. Such a search design is referred to as a resolution 
V.l design. 

For resolution V designs the parameter vector § j in (16.1) contains the mean, 
the main effects, and the 2 -factor interactions, with 

v\ = \ -\- n -\- 

components, and the vector of negligible effects, § 2 , consists of the remaining 
interactions with V 2 = 2 n — v\ terms. For purposes of a search design may be 
(i) as described above or (ii) restricted to involve only the 3-factor interactions, 
in which case V 2 = ( 3 ). 

The case (i) above was considered by Srivastava and Ghosh (1976, 1977), 
whereas case (ii) was considered by Shirakura and Ohnishi (1985). In both cases 
the starting points are balanced resolution V plans (see Section 13.6.5). The 
reason for the restriction to these balanced arrays is the fact that it reduces greatly 
the number of cases one has to consider in order to verify whether condition 
(16.4) is indeed met. Exploiting the form of balanced arrays in light of the rank 
condition (16.4) leads then to the identification of those resolution V designs that 
are also resolution V.l designs. We shall list a few examples in Section 16.4.2. 

16.3.4 Other Search Designs 

In the previous section we have considered search designs for 2 〃 factorials that 
allow us to search for one (k = 1 ) additional interaction among those interactions 
that are ordinarily assumed to be negligible. Extensions to k = 2 have been con¬ 
sidered using Theorem 16.1 [for resolution III.2 plans see, e.g.，Shirakura (1991) 
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and Chatterjee, Deng, and Lin (2001)]. Another extension is that to factors with 
more than two levels. Ghosh and Bums (2001), for example, consider search 
designs for the 3 n factorial, whereas Chatterjee and Mukerjee (1986, 1993) con¬ 
sider the most general case for symmetrical and asymmetrical factorials for A: = 1 
and k > 3, respectively. 

Another application of the principles of a search design is given by Ghosh 
and Bums (2002). They assume that in (16.1) contains only the mean and 
与 2 contains the main effects of the n factors. By assuming that at most k (< n) 
factors have nonnegligible effects, they are able to reduce the number of treatment 
combinations necessary to identify and estimate those effects compared to the 
number of treatment combinations in an orthogonal main effects plan. Note that 
this idea is similar to that used in supersaturated designs (see Section 15.1) except 
that the number of factors here is “small.” 


16.4 LISTING OF SEARCH DESIGNS 

There are not very many methods of constructing search designs. The existing 
methods fall essentially into three categories: 

1. Start with a “minimal” design D\ that allows estimating ^ j in (16.1) assum¬ 
ing that 夸 2 is negligible; then add treatment combinations in the form of 
D2 such that the final design D = D\ D2 satisfies the conditions of a 
search design. 

2. Start with a “larger” design D\ that allows estimating $ 丨 and show that D\ 
itself satisfies the conditions of a search design. 

3. Start with an “intuitive” design D\ and show that it satisfies the conditions 
of a search design, or modify it accordingly. 


Methods 1 and 2 have been used for resolution III.l and V.l, respectively, 
and we shall give some examples below. Method 3 is essentially a trial-and-error 
method and requires a great amount of intuition and insight. 

16.4.1 Resolution III.1 Designs 

Starting with the design D\ given in (16.9), Ghosh (1981) suggested to construct 
the design D 2 iteratively. More precisely, let D 2 , n be the design D 2 for n factors 
with N2, n treatment combinations, then D2, n -\-\ can be represented by the design 
matrix 


-^2,n+l = 

where I> 2,«+1 is a N 2 , n +\ x (n + 1) matrix, D\ n is a (A^+i _ ^2,n) x n 
matrix, and z n -\-\ = (z[ n+1 , z’ 2 行 +i)’ is an A^， w +i x 1 vector. All the elements 


,D2,n Zl ， ”+1 、 
\^2,n z 2,n+l / 


(16.12) 
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in Z> 2,«+1 are, of course, either 0 or 1. We note here that it is possible that 
N2 ， n+\ = N2 ， n. 

To proceed iteratively has the advantage that it greatly reduces the number 
of cases that have to be considered to check the conditions of Theorem 16.5. 
Using this method, Ohnishi and Shirakura (1985) have provided designs D 2 ， n 
for 3 < n < 8, which are given in Table 16.1 with a summary of the various 
model and design parameters as given in Table 16.2. It illustrates the point that 
the total number N of treatment combinations required is relatively small but 
increases proportionately, of course, as n, the number of factors, increases. 

16.4.2 Resolution V.l Designs 

In order to reduce the computational load of checking the rank condition (16.4), 
construction of resolution V.l designs has been limited to balanced arrays (see 
Section 13.6.5). Srivastava and Ghosh (1977) and Shirakura and Ohnishi (1985) 
provide tables of such search designs for n = 4, 5, 6, 7, 8, and we present some 
examples of their results in Table 16.3. 


Table 16.1 Design Matrices Z> 2 ,《 for 3 < w < 8 Factors 



Table 16.2 Number of Parameters and 
Runs for Search Design of Resolution III.l 


n 

vi 

^2 


n 2 

N 

3 

4 

4 

5 

2 

7 

4 

5 

10 

6 

5 

11 

5 

6 

20 

7 

5 

12 

6 

7 

35 

8 

8 

16 

7 

8 

56 

9 

8 

17 

8 

9 

84 

10 

10 

20 
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Table 16.3 Resolution V.l Designs for 4 < w < 8 Factors 


Srivastava and Ghosh (1977) Ohnishi and Shirakura (1985) 


n 


v»2 

N 





y 


V2 

N 




A/ 



4 

11 

5 

15 

1 

1 

1 

1 

0 










5 

16 

16 

21 

1 

1 

1 

0 

10 










6 

22 

42 

28 

1 

1 

1 

0 

01 

0 

20 

28 

1 

0 

0 

10 

1 

1 

7 

29 

99 

36 

1 

1 

1 

0 

00 

10 

35 

35 

0 

1 

0 

00 

1 

10 

8 

37 

219 

45 

1 

1 

1 

0 

00 

010 

56 

44 

0 

1 

0 

00 

0 

110 


Let S n j denote the set of all treatment combinations with exactly j factors 
at level 1 and n — j factors at level 0 , and let Xj denote the number of times 
that each treatment combination in S n j appears in the design (array) (j = 0, 1, 
… ， n). Of course, each vector (treatment combination) in S n j has weight j and 
\! = ( 入 o, 入1， •.. ， 入 n) is referred to as the index set of the array. The resolution 
V.l designs, which are also referred to as BARE 5.1 plans by Srivastava and 
Ghosh (1976), are then completely described by their index sets, as given in 
Table 16.3. 

The Srivastava-Ghosh designs contain 

N n = 1 + + 

runs and can be represented easily as D = S n ，Q U i U S n ^2 U ... U S n , n 一 [ Both 
types of search designs given in Table 16.3 are minimal designs, but designs 
with a larger number of runs are given in the two references. We also note that 
restricting §2 in (16.1) does not reduce the size of the design appreciably. 

16.5 ANALYSIS OF SEARCH EXPERIMENTS 
16.5.1 General Setup 

We shall call an experiment that employs a search design a search experiment. 
The analysis of data from a search experiment are based, of course, on submodels 
of (16.1), which we write as 

y = + -^21 (O? + e (16.13) 

where ^ refers to k possible components of i n (16.1) and refers to the 

column vectors in A 2 corresponding to The analysis will then be performed 
for each possible ^ from 与 2 . 

For purposes of characterizing and comparing the outcomes from the various 
analyses we shall denote by the true (unknown) nonnegligible effects in ^ 2 - 
We shall then refer to model (16.13) with ^ as MO, and with < # as 
Ml, where Ml represents a series of competing models to MO. 
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16.5.2 Noiseless Case 

Even though in practice there does not exist a situation where ^ = 0 in (16.3), we 
have pointed out earlier that this noiseless case is important for purposes of con¬ 
structing search designs. The noiseless case is equally important and instructive 
for describing and comparing the various analyses based on MO and Ml. 

We shall illustrate this for the case k = l by first constructing a data set with 
known parameters ^ j and and then using (16.13) with ^ = 0 to analyze those 
data. 


Example 16J Let us consider the case of n = 3 factors and the search design 
with D 2 given in Table 16.1. Thus, in model (16.1), 

/ \ 

AC 
=BC 

\abcJ 

Let [i = 10, A = 4, B = -2, C = 3, AB = 2, AC = 0, BC = 0, ABC = 0. 
Then using (16.8) [see also (7.30)] with A* = 2, = -1, C* = 1.5, (AB)* = 1 

we obtain the true yield 3 ； = a(x) for the seven treatment combinations of the 
design D as follows: 


Treatment 

Combination 

10 

A* 

2 

B* 

-1 

c* 

1.5 

1 

y 

111 

1 

-1 

-1 

-1 

1 

13.5 

000 

1 

-1 

-1 

-1 

1 

8.5 

Oil 

1 

-1 

1 

1 

-1 

7.5 

10 1 

1 

1 

-1 

1 

-1 

13.5 

110 

1 

1 

1 

-1 

1 

10.5 

1 00 

1 

1 

-1 

-1 

-1 

10.5 

00 1 

1 

-1 

-1 

1 

-1 

11.5 



The analysis of these data, using SAS PROC GLM, is presented in Tables 
16.4a-16.4c for the models with f = AB, AC, and BC, respectively. We shall 
comment briefly on some aspects of the output: 

1. Since AB = ( 0 ，the model used in Table 16.4 a provides a perfect fit to 
the data as indicated by the fact that SS(Error) = SSE(MO) = 0， which 
is equivalent to the fact that the estimated effects are equal to the true 
effects, for example, A = A, and that the predicted values y~ are equal to 
the observed values j. In other words, the true model is identified correctly. 
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Table 16.4 Data and Analysis for Search Experiment (Noiseless Case) 

options nodate pageno=l ； 
data search ； 
input A B C y ； 
datalines ； 

111 13.5 
000 8.5 
011 7.5 
101 13.5 
110 10.5 
100 10.5 
001 11.5 

run ； 

proc print data=search ； 
titlel 1 TABLE 16.4'; 

title2 'DATA FOR SEARCH EXPERIMENT'; 

title3 '(NOISELESS CASE)'; 

run ； 

proc glm data=search ； 
class A B C; 
model y=A B C A*B/e ； 
estimate 'A' A -1 1 ； 
estimate 1 B' B -1 1; 
estimate 'C' C -1 1 ； 

estimate 1 A*B 1 A*B 1 -1 -1 l/divisor=2 ； 
output out=predicta p=yhat r=resid stdr=eresid ； 
titlel 'TABLE 16.4a'; 

title2 'ANALYSIS WITH MODEL Y=A B C A*B'; 

title3 '(NOISELESS CASE)'; 

run ； 

proc print data=predicta ； 
run ； 

proc glm data=search ； 
class A B C ； 
model y=A B C A*C/e ； 
estimate 'A' A -1 1; 
estimate 'B' B -1 1 ； 
estimate 'C' C -1 1 ； 

estimate 'A*C' A*C 1 -1 -1 l/divisor=2 ； 
output out=predictb p=yhat r=resid stdr=eresid ； 
titlel 'TABLE 16.4b'; 
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Table 16.4 (Continued) 


title2 'ANALYSIS WITH MODEL ABC A*C'; 

title3 '(NOISELESS CASE)'; 

run ； 


proc print data=predictb ； 
run ； 


proc glm data=search ； 
class A B C ； 


model y=A B C B*C/e 
estimate 'A' A -1 1 
estimate 'B' B -1 1 
estimate 'C' C -1 1 


estimate 1 B*C' B*C -111 -l/divisor=2 ； 
output out=predictc p=yhat r=resid stdr=eresid ； 
titlel 'TABLE 16.4c'; 

title2 'ANALYSIS WITH MODEL Y=A B C B*C'; 

title3 '(NOISELESS CASE)'; 

run ； 


proc print data=predictc ； 
run ； 


Obs ABC 


y 


1 

2 

3 

4 

5 

6 
7 


1 1 
0 0 
0 1 
1 0 
1 1 
1 0 
0 0 


1 13.5 

0 8.5 

1 7.5 

1 13.5 

0 10.5 

0 10.5 

1 11.5 


A. ANALYSIS WITH MODEL Y=A B C A*B 
(NOISELESS CASE) 


The GLM Procedure 

Class Level Information 


Class 

A 

B 

C 


Levels 

2 

2 

2 


Values 

0 1 
0 1 
0 1 
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Table 16.4 (Continued) 


Number of observations Read 7 
Number of observations Used 7 

General Form of Estimable Functions 



Effect 

Coefficients 




Intercept 

LI 




A 

0 

L2 




A 

1 

L1-L2 




B 

0 

L4 




B 

1 

L1-L4 




C 

0 

L6 




C 

1 

L1-L6 




A*B 

0 0 

L8 




A*B 

0 1 

L2-L8 




A*B 

10 

L4-L8 




A*B 

11 

L1-L2-L4+L8 



Dependent 

Variable : y 

Sum of 




Source 

DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 

4 

31.42857143 

7.85714286 

Infty 

< . 0001 

Error 

2 

0.00000000 

0.00000000 



Corrected 

Total 6 

31.42857143 





R-Square 

Coeff Var 

Root MSE 

y Mean 



1.000000 

0 

0 10 

.78571 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

A 

1 

13.76190476 

13.76190476 

Infty 

< . 0001 

B 

1 

1.66666667 

1.66666667 

Infty 

< • 0001 

C 

1 

10.00000000 

10.00000000 

Infty 

< . 0001 

A*B 

1 

6.00000000 

6.00000000 

Infty 

< . 0001 
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Table 16.4 (Continued) 


Source 



DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

A 




1 

24.00000000 

24 . 

00000000 

Infty 

< . 0001 

B 




1 

6.00000000 

6 . 

00000000 

Infty 

< . 0001 

C 




1 

13.50000000 

13.50000000 

Infty 

< . 0001 

A*B 




1 

6.00000000 

6 . 

00000000 

Infty 

< . 0001 







Standard 





Parameter 



Estimate 

Error 

t 

Value Pr 

> Itl 


A 




4.00000000 


0 


Infty 

c . 0001 


B 




-2.00000000 


0 


-Infty 

c . 0001 


C 




3.00000000 


0 


Infty 

< . 0001 


A*B 




2.00000000 


0 


Infty 

< . 0001 


Obs 

A 

B 


C y 

yhat 


resid eresid 


1 

1 

1 


1 13.5 

13.5 



0 



2 

0 

0 


0 8.5 

8.5 



0 



3 

0 

1 


1 7.5 

7.5 

-2 

.6645E-15 



4 

1 

0 


1 13.5 

13.5 



0 



5 

1 

1 


0 10.5 

10.5 



0 



6 

1 

0 


0 10.5 

10.5 



0 



7 

0 

0 


1 11.5 

11.5 



0 






B. 

ANALYSIS WITH 

MODEL 

A B 

C A*C 







(NOISELESS CASE) 





The GLM Procedure 

General Form of Estimable Functions 


Effect 


Coefficients 

Intercept 

LI 

A 

0 

L2 

A 

1 

L1-L2 

B 

0 

L4 

B 

1 

L1-L4 

C 

0 

L6 

C 

1 

L1-L6 

A*C 

0 0 

L8 

A*C 

0 1 

L2-L8 

A*C 

1 0 

L6-L8 





ANALYSIS OF SEARCH EXPERIMENTS 


623 


Table 16.4 (Continued) 


Dependent Variable : y 


Sum of 


Source 



DF 

Squares 

Mean Square 

F 

Value 

Pr > F 

Model 



4 

26.09523810 

6.52380952 


2.45 

0.3106 

Error 



2 

5.33333333 

2.66666667 




Corrected 

Total 

6 

31.42857143 






R- 

Square 

Coeff Var 

Root MSE 

y 

Mean 



0 . 

830303 

15.14033 

1.632993 

10.78571 


Source 



DF 

Type I SS 

Mean Square 

F 

Value 

Pr > F 

A 



1 

13.76190476 

13.76190476 


5.16 

0.1511 

B 



1 

1.66666667 

1.66666667 


0.62 

0.5120 

C 



1 

10.00000000 

10.00000000 


3.75 

0.1924 

A*C 



1 

0.66666667 

0.66666667 


0.25 

0.6667 

Source 



DF 

Type III SS 

Mean Square 

F 

Value 

Pr > F 

A 



1 

16 .66666667 

16.66666667 


6.25 

0.1296 

B 



1 

2.66666667 

2.66666667 


1.00 

0.4226 

C 



1 

8.16666667 

8.16666667 


3.06 

0.2222 

A*C 



1 

0.66666667 

0.66666667 


0.25 

0.6667 






Standard 




Parameter 



Estimate 

Error t Value Pr 

> |t| 

A 




3.33333333 

1.33333333 

2.50 0 

.1296 

B 




-1.33333333 

1.33333333 

-1.00 0 

.4226 

C 




2.33333333 

1.33333333 

1.75 0 

.2222 

A*C 




0.66666667 

1.33333333 

0.50 0 

.6667 

Obs 

A 

B 


C y 

yhat 

resid 

eresid 

1 

1 

1 


1 13.5 

12.8333 0. 

66667 

0.94281 

2 

0 

0 


0 8.5 

8.5000 0. 

00000 



3 

0 

1 


1 7.5 

8.8333 -1. 

33333 

0.94281 

4 

1 

0 


1 13.5 

14.1667 -0. 

66667 

0.94281 

5 

1 

1 


0 10.5 

9.8333 0. 

66667 

0.94281 

6 

1 

0 


0 10.5 

11.1667 -0. 

66667 

0.94281 

7 

0 

0 


1 11.5 

10.1667 1.: 

33333 

0.94281 
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Table 16.4 (Continued) 

C. ANALYSIS WITH MODEL Y=A B C B*C 
(NOISELESS CASE) 

The GLM Procedure 

General Form of Estimable Functions 



Effect 

Coefficients 




Intercept 

LI 




A 0 

L2 




A 1 

L1-L2 




B 0 

L4 




B 1 

L1-L4 




C 0 

L6 




C 1 

L1-L6 




B*C 0 0 

L8 




B*C 0 1 

L4-L8 




B*C 1 0 

L6-L8 




B*C 1 1 

L1-L4-L6+L8 


Dependent 

Variable : y 

Sum of 



Source 

DF 

Squares 

Mean Square 

F Value 

Model 

4 

26.09523810 

6.52380952 

2.45 

Error 

2 

5.33333333 

2.66666667 


Corrected 

Total 6 

31.42857143 




R-Square 

Coeff Var 

Root MSE 

y Mean 


0.830303 

15.14033 

1.632993 10 

.78571 

Source 

DF 

Type I SS 

Mean Square 

F Value 

A 

1 

13.76190476 

13.76190476 

5.16 

B 

1 

1.66666667 

1.66666667 

0.62 

C 

1 

10.00000000 

10.00000000 

3.75 

B*C 

1 

0.66666667 

0.66666667 

0.25 


Pr > F 

0.3106 


Pr > F 

0.1511 
0.5120 
0.1924 
0.6667 
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Table 16.4 

{Continued) 





Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

A 

1 

16.66666667 

16.66666667 

6.25 

0.1296 

B 

1 

2.66666667 

2.66666667 

1.00 

0.4226 

C 

1 

8.16666667 

8.16666667 

3.06 

0.2222 

B*C 

1 

0.66666667 

0.66666667 

0.25 

0.6667 






Standard 




Parameter 


Estimate 

Error 

t Value 

Pr > |t 丨 

A 



3.33333333 

1.33333333 

2.50 


0.1296 

B 



-1.33333333 

1.33333333 

-1.00 


0.4226 

C 



2.33333333 

1.33333333 

1.75 


0.2222 

B*C 



0.66666667 

1.33333333 

0.50 


0.6667 

Obs 

A 

B 

C y 

yhat 

resid 


eresid 

1 

1 

1 

1 13.5 

12.1667 

1.33333 

0 

. 94281 

2 

0 

0 

0 8.5 

7.8333 

0.66667 

0 

. 94281 

3 

0 

1 

1 7.5 

8.8333 

-1.33333 

0 

. 94281 

4 

1 

0 

1 13.5 

14.1667 

-0.66667 

0 

. 94281 

5 

1 

1 

0 10.5 

10.5000 

0.00000 



6 

1 

0 

0 10.5 

11.1667 

-0.66667 

0 

. 94281 

7 

0 

0 

1 11.5 

10.8333 

0.66667 

0 

. 94281 


2. For the models with f = AC and BC, Tables 16.4b and 16.4c show 
that SS(Error)=SSE(Ml)>0, as they should because the wrong models 
are used. 

3. In all three cases the expressions for estimable functions confirm the the¬ 
oretical results that, individually, AB, AC, and BC are estimable. 

4. It is, of course, clear that ABC cannot be estimated since its definition 

depends on all eight treatment combinations of the 2 3 factorial. □ 

16.5.3 Noisy Case 

As we have seen in the previous section, in the noiseless case the true model 
is identified correctly. This may not necessarily happen in the noisy case, that 
is, when var ⑷ =V ^ 0. However, based on the results for the noiseless case, 
where SSE(M0)=0, it seems plausible to select that model as the correct model 
for which SSE achieves the smallest value among competing models. This, in 
fact, was suggested by Srivastava (1975). 

We illustrate this procedure in the following example. 

Example 16.2 We consider the same search design as in Example 16.1. We also 
use the same parameter values as in Example 16.1. The observations are obtained 
by adding N(0, 1) noise to each true value. The data are given in Table 16.5. 
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Table 16.5 Data and Analysis for Search Experiment (With N(0,1) Noise) 


Obs ABC 


y 


1 11 

2 0 0 

3 0 1 

4 10 

5 11 

6 10 

7 0 0 


1 13.792 
0 9.013 
1 7.445 
1 15.187 
0 10.176 
0 10.606 
1 11.638 


A. ANALYSIS WITH MODEL Y=A B C A*B 
(WITH N(0, 1) NOISE) 


The GLM Procedure 
Class Level Information 


Class 

A 

B 

C 


Levels 

2 

2 

2 


Values 

0 1 
0 1 
0 1 


Number of observations 7 


Dependent Variable : y 


Sum of 


Source 

DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 

4 

42.09222138 

10.52305535 

22.00 

0.0439 

Error 

2 

0.95654033 

0.47827017 



Corrected Total 6 

43.04876171 





R-Square 

Coeff Var 

Root MSE 

y Mean 



0.977780 

6.217804 

0.691571 11 

.12243 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

A 

1 

16.20876430 

16.20876430 

33.89 

0.0283 

B 

1 

4.81496682 

4.81496682 

10.07 

0.0866 

C 

1 

15.73393923 

15.73393923 

32.90 

0.0291 

A*B 

1 

5.33455104 

5.33455104 

11.15 

0.0792 
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Table 16.5 (Continued) 


Source 

DF 

Type III SS 

Mean Square 

FValue 

Pr > F 

A 

1 

29.79504504 

29.79504504 

62.30 

0.0157 

B 

1 

11.74600417 

11.74600417 

24.56 

0.0384 

C 

1 

19.51928067 

19.51928067 

40.81 

0.0236 

A*B 

1 

5.33455104 

5.33455104 

11.15 

0.0792 




Standard 



Parameter 


Estimate 

Error 

tValue 

Pr > 111 

A 


4.45683333 

0.56466519 

7.89 

0.0157 

B 


-2.79833333 

0.56466519 

-4.96 

0.0384 

C 


3.60733333 

0.56466519 

6.39 

0.0236 

A*B 


1.88583333 

0.56466519 

3.34 

0.0792 


Obs 

A 

B 

c 

y 

yhat 

resid 

eresid 

1 

1 

1 

1 

13.792 

13.7877 

0.00433 

0.39928 

2 

0 

0 

0 

9.013 

8.5218 

0.49117 

0.39928 

3 

0 

1 

1 

7.445 

7.4450 

-0.00000 


4 

1 

0 

1 

15.187 

14.7002 

0.48683 

0.39928 

5 

1 

1 

0 

10.176 

10.1803 

-0.00433 

0.39928 

6 

1 

0 

0 

10.606 

11.0928 

-0.48683 

0.39928 

7 

0 

0 

1 

11.638 

12.1292 

-0.49117 

0.39928 




B. 

ANALYSIS WITH MODEL Y=A B C A*C 



(WITH N(0, 1) NOISE) 

The GLM Procedure 

Class Level Information 

Class Levels Values 

A 2 0 1 

B 2 0 1 

C 2 0 1 

Number of observations 7 

Dependent Variable : y 

Sum of 

Source DF Squares Mean Square F Value Pr > F 


Model 4 39.22872871 9.80718218 5.13 0.1696 

Error 2 3.82003300 1.91001650 

Corrected Total 6 43.04876171 
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Table 16.5 (Continued) 




R-Square 

Coef f Var 

Root MSE 


y Mean 



0.911263 

12.42564 

1.382033 


11.12243 

Source 


DF 

Type I SS 

Mean Square 

F Value Pr > F 

A 



1 

16.20876430 

16.20876430 

8.49 0.1004 

B 



1 

4.81496682 

4.81496682 

2.52 0.2533 

C 



1 

15.73393923 

15.73393923 

8.24 0.1030 

A*C 



1 

2.47105837 

2.47105837 

1.29 0.3733 

Source 


DF 

Type III SS 

Mean Square 

F Value Pr > F 

A 



1 

20.14284038 

20.14284038 

10.55 0.0832 

B 



1 

6.03605400 

6.03605400 

3.16 0.2174 

C 



1 

11.88633750 

11.88633750 

6.22 0.1301 

A*C 



1 

2.47105837 

2.47105837 

1.29 0.3733 






Standard 




Parameter 


Estimate 

Error 

t Value Pr > 111 


A 



3.66450000 

1.12842560 


3.25 0.0832 


B 


- 

2.00600000 

1.12842560 


-1.78 0.2174 


C 



2.81500000 

1.12842560 


2.49 0.1301 


A*C 



1.28350000 

1.12842560 


1.14 0.3733 


Obs 

A B 

C 

y 

yhat 

resid eresid 


1 

1 1 

1 

13.792 

13.4865 

0 

3055 0.79792 


2 

0 0 

0 

9.013 

9.0130 

0 

0000 . 


3 

0 1 

1 

7.445 

8.5385 

-1 

0935 0.79792 


4 

1 0 

1 

15.187 

15.4925 

-0 

3055 0.79792 


5 

1 1 

0 

10.176 

9.3880 

0 

7880 0.79792 


6 

1 0 

0 

10.606 

11.3940 

-0 

7880 0.79792 


7 

0 0 

1 

11.638 

10.5445 

1 

0935 0.79792 


C. ANALYSIS WITH MODEL Y=A B C B*C 
(WITH N(0, 1) NOISE) 

The GLM Procedure 

Class Level Information 

Class Levels Values 

A 2 0 1 

B 2 0 1 

C 2 0 1 
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Table 16.5 (Continued) 


Number of observations 7 


Dependent Variable : y 




Sum of 




Source 

DF 

Squares 

Mean Square 

FValue 

Pr > F 

Model 

4 

37.33955238 

9.33488810 

3.27 

0.2477 

Error 

2 

5.70920933 

2.85460467 



Corrected Total 6 

43.04876171 





R-Square 

Coef f Var 

Root MSE 

y Mean 



0.867378 

15.19055 

1.689558 11 

.12243 


Source 

DF 

Type I SS 

Mean Square 

FValue 

Pr > F 

A 

1 

16.20876430 

16.20876430 

5.68 

0.1400 

B 

1 

4.81496682 

4.81496682 

1.69 

0.3236 

C 

1 

15.73393923 

15.73393923 

5.51 

0.1434 

B*C 

1 

0.58188204 

0.58188204 

0.20 

0.6959 

Source 

DF 

Type III SS 

Mean Square 

FValue 

Pr > F 

A 

1 

21.99952017 

21.99952017 

7.71 

0.1090 

B 

1 

7.07094704 

7.07094704 

2.48 

0.2562 

C 

1 

13.32209004 

13.32209004 

4.67 

0.1633 

B*C 

1 

0.58188204 

0.58188204 

0.20 

0.6959 




Standard 



Parameter 


Estimate 

Error t Value 

Pr > 111 

A 


3.82966667 

1.37951795 

2.78 

0.1090 

B 

- 

2.17116667 

1.37951795 

-1.57 

0.2562 

C 


2.98016667 

1.37951795 

2.16 

0.1633 

B*C 


0.62283333 

1.37951795 

0.45 

0.6959 


Obs 

A 

B 

c 

y 

yhat 

resid 

eresid 

1 

1 

1 

1 

13.792 

12.5333 

1.25867 

0.97547 

2 

0 

0 

0 

9.013 

7.8947 

1.11833 

0.97547 

3 

0 

1 

1 

7.445 

8.7037 

-1.25867 

0.97547 

4 

1 

0 

1 

15.187 

15.3273 

-0.14033 

0.97547 

5 

1 

1 

0 

10.176 

10.1760 

0.00000 


6 

1 

0 

0 

10.606 

11.7243 

-1.11833 

0.97547 

7 

0 

0 

1 

11.638 

11.4977 

0.14033 

0.97547 
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The individual analyses for different models are given in Tables 16.5a-16.5c. The 
summary below shows that in this case we would have selected the true model over 
the others: 



Model 

SS(Error) 


M0: 

A, B, C, AB 

0.478 


Ml: 

A, 5, C, AC 

1.910 


Ml: 

A, B ， C, BC 

2.855 

□ 


16.6 SEARCH PROBABILITIES 

The discussion of the preceding section can be expressed in terms of probabilities 
as follows: For the noiseless case we have 

P[SSE(M0) < SSE(Ml) I MO, Ml, V = 0] = 1 

and for the noisy case we have 

P[SSE(M0) < SSE(Ml) I MO, Ml, F ^0] < 1 (16.14) 

Let us assume, for operational purposes, that var(^) = V = l. If = oo, 
then MO and Ml are indistinguishable, and hence we can rewrite (16.14) more 
precisely as 

P[SSE(M0) < SSE(Ml) I MO, Ml, of = oo] = 士 

The probability P[SSE(M0) < SSEM1) | MO, Ml, cr^] is called a search prob¬ 
ability, which assumes values between ^ and 1 (Ghosh and Teschmacher, 2002). 

The expression in (16.14) can be written out more explicitly as follows. To 
simplify the notation, let us write 

A^) = [A u A 2 im M(^) = AiO'A^) 

Then the error sum of squares, SSE, for model (16.13), which we shall denote 
more precisely by s(<) 2 , can be written as 

s(0 2 = y'U - Q«)]y (16.15) 

where 

(see Section 1.4.4.3). Rather than using (16.15) in (16.14) and minimizing ^(^) 2 , 
Shirakura, Takahashi, and Srivastava (1996) propose maximizing 



, y) = y f [QiO - Q]y 
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where 

e = 

and 

s «) 2 = y'(i - Q)y-H^,y) 

They then define 

P = min min P[h(^ 0 , j) > h(^, j)] (16.16) 

〈0 C 爸 2 S e O 

as the search probability for a given search design, where A(^ z ; denotes the 

set of all possible ^ of § 2 ， of which at least one effect is not of < 0 , and 

represents the vector of the nonnegligible effects in 夸 2 . 

For the case k = 1 and assuming that e 〜 _/V(0, lo^\ Shirakura, Takahashi, 
and Srivastava (1996) derive an explicit expression for 

PK ， cr》) = P[h^ 0 , y) ~ y)] (16.17) 

after rewriting (16.13) as 

y = ^ 1^1 + + ^ ( 16 . 18 ) 

where ^ is one effect of and a(^) is the column vector in A 2 corresponding 

to f, and fo represents the nonnegligible effect in 与 2 . The expression for (16.17) 
is then given as follows (see Ghosh and Teschmacher, 2002): 

尸 (fo, l 〜 2 ) = .5 + 2[0(pci«o, O) — .5] 


X [4>(p^/r(^0, ^ 0 ) - Cj(fo, O) - -5] (16.19) 

where 0(-) is the standard normal cdf, r(y, ij) = a\y)[I — A\(A\A\)~ l A\]a(r]) 
for (y,")= ( 如， f 0 ), (f 0 , OU) ， 吨 0 , O = Kfo, ？ o)f ) ， 

O = [ 厂 (Co, <o)/2][l - I <) I]，and p = Uo IAv 

It follows immediately from (16.19) that .5 < ap) < 1, as stated 

before, and that P(fo, ^ a e) increases as p increases. 

Using the search design and parameters in Example 16.2 we illustrate the use 
of (16.19) in the next example. 

Example 16.3 From Example 16.1 we obtain for use in (16.18) 

a(^)=a\AB) = (l 1-1-1 1-1 1) 

a(i ； x)=a\AC) = (l 1-1 1-1 -1 -1) 
a^ 2 ) = a\BC) = (1 1 1-1-1 1 -1) 
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and 


/ 1 1 
1 -1 
1 -1 
Ai = 1 1 

1 1 
1 1 
V 1 -1 


1 1 \ 

-1 -1 

1 1 

-1 1 

1 -1 

-1 -1 

-11 / 


with r(fo, fo) = 6, r(fo, fi) = 3, r(fo, ？ 2 ) = — 1 ， Kfi ，？ 1 ) = 1.5, 吻， < 2 ) = 5.5, 
又 (? 0 , fi) = 1 ， x(fo, ？ 2 ) = -0.1741 ， Cj(fo, <i) = o, Cj«o, ? 2 ) = 2.4777, p = 1. 
We then obtain P(fo, <i, 1) = .5 ， P«o, < 2 , 1) = .9144. Individually, the results 
show that there is a much higher probability of distinguishing between the true 
model (including AB) and the model including BC. If we adopt (16.16) as the 
overall measure for the given design and parameters, then the search probability 
P is only .5. □ 


Ghosh and Teschmacher (2002) show how search probabilities can be used to 
compare different search designs. They develop criteria so that such comparisons 
can be made without requiring a specific value of p = | (o \/cr e . 



CHAPTER 17 


Robust-Design Experiments 


As we have mentioned before, factorial designs in their various forms were orig¬ 
inally developed and used in the context of agricultural experimentation. They 
have since found wide application in almost all fields of empirical research. 
Important contributions to factorial experimentation has come from applications 
in engineering and manufacturing, a development usually associated with the 
name Taguchi. The so-called Taguchi method and variations of this method 
are concerned with quality engineering and product improvement. According 
to Taguchi (1993, p. 4)，“the objective of quality engineering is to discover the 
countermeasure which results in an overall reduction in the total loss to both 
the company and society. It is therefore important to evaluate the two kinds of 
losses — the loss due to function and one due to harmful effects.” Thus, the basic 
philosophy here rests on the concept of societal loss due to imperfect production 
and “environmental” factors and the role that experimental design can play to 
reduce that loss. We shall discuss some of the underlying ideas in the following 
sections. 


17.1 OFF-LINE QUALITY CONTROL 

An important aspect of any production process is quality control. Typically this 
is performed during the manufacturing phase of the process. Various forms of 
inspection sampling can be applied to ensure that the product meets certain spec¬ 
ification and that the process is in control (see, e.g., Montgomery, 1991). There 
exists, however, another form of quality control. It is conducted at the product or 
process design stage, thus preceding the manufacturing stage. This type of quality 
control is therefore referred to as off-line quality control. Its aim is to improve 
product manufacturability and reliability, and to reduce product development and 
lifetime costs (Kackar, 1985). 


Design and Analysis of Experiments. Volume 2: Advanced Experimental Design 

By Klaus Hinkelmann and Oscar Kempthorne 

ISBN 0-471-55177-5 Copyright © 2005 John Wiley & Sons, Inc. 


633 



634 


ROBUST-DESIGN EXPERIMENTS 


It is in this context that Taguchi (1986 ， 1987) introduced the application of 
what he called robust parameter design (parameter here refers to the characteristic 
of a product). According to Taguchi (1986, p. 98) its purpose “is to adjust the 
parameter levels so that the objective characteristic will not vary much even if 
the system and environmental parameters change. It is a search for the parameter 
levels at which the characteristic is stable.” More explicitly, Montgomery (1999) 
reformulates this in terms of making (1) products and processes insensitive to 
environmental factors or other factors hard to control and to variation transmitted 
from components and (2) finding levels of the process variables that achieve a 
mean performance at a specified (target) value and at the same time reducing 
variability around that value. 


17.2 DESIGN AND NOISE FACTORS 

With the previous statements we have implicitly acknowledged the existence of 
two categories of factors that affect the characteristic of the product: Design 
factors and noise factors. This categorization is a new feature that we have not 
encountered before, at least not in this explicit form. It is, however, an important 
feature in conducting and analyzing robust-design experiments. 

Design factors, sometimes also referred to as design parameters (Kackar, 1985) 
or control factors (Taguchi, 1993)，are those factors that can be controlled easily 
during the manufacturing process, the values of which remain essentially fixed 
at “optimal” levels. Noise factors, on the other hand, are factors that are hard 
to control, such as environmental factors, and, as a consequence, their levels 
will vary within a certain range during the production process and/or during 
subsequent use of the product [for a detailed description of various types of 
noise factors see, e.g.，Wu and Hamada (2000)]. 

As an example we mention an experiment performed in a semiconductor 
factory to determine the influence of five factors on the transistor gain for a 
particular device (Montgomery, 1999). The three factors implant dose, drive- 
in time ， and vacuum level are easy to control and hence represent the design 
factors. The factors oxide thickness and temperature are hard to control during 
the manufacturing process and hence represent the noise factors. 

The underlying idea of the Taguchi method is to incorporate the noise factors 
into the product design phase by varying their settings systematically over a range 
representative of the variation during typical manufacturing and/or use conditions. 
The aim then is to find settings for the design factors so that the influence of the 
noise factors on the product performance is minimized or, to express it differently, 
to make the product robust with respect to the noise factors. It is for this reason 
that this methodology is referred to as robust design or robust parameter design 
(or parameter design for short). 
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17.3 MEASURING LOSS 

As mentioned earlier, the notion of loss to society and to the company forms the 
underpinning of the Taguchi method. To assess the loss we need to specify a loss 
function as a function of a value of the performance characteristic of the product, 
say Y, and the target value, say r, for that performance characteristic. We can 
think of many loss functions, but in practice it is usually difficult to come up 
with the actual loss function. However, in many situations an appropriate loss 
function is given by 


£(Y) = c(Y - r) 2 


(17.1) 


where c is some constant, which can be determined if i{Y) is known for some 
value of Y. 

One way to determine c in (17.1) was described by Kackar (1985). Let (r — A, 
r + A) be the consumer’s tolerance interval in the sense that if Y falls outside 
this interval, then the product needs either to be repaired or discarded. Suppose 
the cost for that action is A dollars. It follows then from (17.1) that A = cA 2 
and hence c = A/A 2 . 

The loss function (17.1) is symmetric around r. This may not always be 
appropriate as the loss may depend on whether Y < z or Y > r. In this case a 
simple variation of (17.1) may lead to the form 


^(Y) = 


ci(F-r) 2 
C2(Y - t) 2 


if F < r 
if F > r 


(17.2) 


where ci and C 2 are constants that can be determined using arguments similar to 
those given above. 

A special case of (17.2) occurs if r = 0. Then (17.2) reduces to 

£(Y) = c 2 Y 2 if 7 > 0 (17.3) 


In either of the cases (17.1), (17.2), (17.3) we consider the expected loss 

L = E[l(Y)] (17.4) 


as the criterion to assess the quality of a product or process. The expectation in 
(17.4) is taken with respect of the distribution of Y during the product’s lifetime 
and across different users. The latter is represented to a great extent by some or 
all of the noise factors. 
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17.4 ROBUST-DESIGN EXPERIMENTS 

In order to minimize the average loss (17.4) we perform an experiment, the 
robust-design experiment. The purpose of this experiment is to determine the 
optimal levels for the control factors in conjunction with reducing the product’s 
performance around the target value. There are several ways in which such an 
experiment can be performed making use of some of the factorial designs we 
have discussed in previous chapters or certain types of response surface designs 
(see 1.12.3 and 1.12.4). 

17.4.1 Kronecker Product Arrays 

Distinguishing between the two sets of factors, design and noise factors, suggests 
the use of the following experimental design: Choose an appropriate fractional 
factorial design for the design factors and another one for the noise factors and 
then combine these two designs in the form of a Kronecker product design; 
that is, each design factor level combination occurs with each noise factor level 
combination. Taguchi (1986) referred to these two component designs as inner 
array and outer array, respectively. Moreover, he suggested the use of main 
effect plans in the form of orthogonal arrays. 

As an illustration consider the following example. 


Example 17.1 Suppose we have four design factors A\, A 2 , A 3 , A 4 , each 
at three levels, and three noise factors, B\, B 2 , B 3 , each at two levels. The inner 
array is then given by OA(9, 4, 3, 2; 1) and the outer array by OA(4, 3, 2, 2; 1) 
(see Example 13.9). Then the Kronecker product design or the so-called cross 
array consists of 9 x 4 = 36 runs. □ 


An important aspect of the cross array is that the interactions between indi¬ 
vidual design and noise factors can be estimated (see Example 13.9). If such 
interactions exist, they can be exploited to choose the design factor settings such 
that the product variation over the noise factor settings will be reduced (we shall 
return to this point in Section 17.5). 

On the other hand, the number of runs required for a cross array may be quite 
large, even when we use minimal designs for both the design and noise factors. 
This disadvantage of cross arrays will become even more pronounced if we allow 
at least some 2-factor interactions between design factors by using a resolution 
IV or resolution V design instead of a main effect plan. 

17.4.2 Single Arrays 

The last two comments above provide a strong argument to look for alternative 
experimental designs in connection with robust-design experiments. This can be 
achieved by combining the design and noise factors into one set of factors and 
then obtain an appropriate fractional factorial design that achieves the objectives 
of the experiment. Such experimental designs are referred to as single array or 
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common array. We shall illustrate this approach with some examples. For a listing 
of useful single arrays see Wu and Hamada (2000). 

Example 17.2 Suppose we have three design factors A\, A2, A3 and two 
noise factors B\, B 2 , each at two levels. The usual Taguchi method would cross 
the array with the 2 2 array with 4x4=16 runs. Suppose, however, that 
we are interested also in estimating 2 -factor interactions between design factors. 
In that case the crossed array would be of the form 2 3 x 2 2 with 8 x 4 = 32 
runs. A single array can be obtained by considering the 2y _1 array based on the 
identity relationship I = AiA 2 A 3 B 1 B 2 with only 16 runs. □ 

Example 17.3 Suppose we have five design factors A\, A2, A3, A4, A5 and 
two noise factors B\, B 2 , each at two levels. The usual cross array would be of 
the form 2j £ 2 x 2 2 with 8 x 4 = 32 runs. The single array 2jy 2 , also with 32 
runs, based on the identity relationship 

I = A1A2A3A5 = A1A2A4B1B2 = A^A^A^,B\B2 

has the additional property that it allows us to estimate at least some of the 
2 -factor interactions among design factors, assuming that only interactions involv¬ 
ing three or more factors are negligible. □ 

Example 17.4 Suppose we have three design factors A\, A2, A3, each at 
three levels, and two noise factors, Bi, B 2 , each at two levels. The usual cross 
array would be of the form 3^ x 2 2 with 9 x 4 = 36 runs. Here the 3 ^， 
referred to [ 9 ( 3 3 ) by Taguchi, is based on / = A1A2A3. An alternative single 
array in the form of a modified central composite design for five factors (see 
1.12.4.3) was proposed by Myers, Khuri, and Vining (1992). Their design consists 
of three parts: (1) The factorial part is a 2y _1 based on / = Ai A 2 A 3 B 1 B 2 and 
using only the high and low levels of each factor; ( 2 ) the axial part consists of the 
axial points for the control factors, keeping the noise factors at their intermediate 
level (assuming that the noise factors are quantitative factors), and (3) no center 
runs. Using the response surface methodology convention of denoting the low, 
intermediate and high levels by — 1 , 0 ， 1 , respectively, the final design is given 
in Table 17.1. 

This design, consisting of 16 + 6 + no = 22 + no runs, allows us to estimate 
all main effects, all 2-factor interactions between design factors, and the 2-factor 
interactions between design and noise factors. □ 

The single arrays discussed above provide some insight into their general 
structure. It also shows that single arrays may be more economical than crossed 
arrays or that they provide more information. These are aspects that may deter¬ 
mine which type of array we should choose for a given situation. However, to 
some extent this choice will also be influenced by the type of modeling we intend 
to use for data from such experiments, that is, how we intend to use the data in 
order to arrive at a robust product design. We shall discuss some of these issues 
in the following sections. 
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Table 17.1 Central Composite Design for Three 
Design Factors and Two Noise Factors 


M ^2 A3 B\ B2 


1 

1 

1 

1 

1 

1 






1 

1 


1 


一 1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

1 

0 

0 

0 

0 

0 

-1 

0 

0 

0 

0 

1 

0 

0 


0 0 0 0 0 no 


17.5 MODELING OF DATA 

Just as there are different approaches to deciding on a statistical design for a 
robust-design experiment, there are different ways of analyzing the data from 
such an experiment. The proposal by Taguchi (1986) to use various forms of 
signal-to-noise ratios (S/N ratios) to analyze and interpret such data has come 
under a certain amount of criticism (e.g., Box, 1988) and as a consequence other 
methods of looking at the data from these experiments have been proposed. We 
shall indicate some of these ideas in the following sections. 

17.5.1 Location and Dispersion Modeling 

As we have mentioned earlier, the novel idea associated with robust-design exper¬ 
iments is not only to identify the factors that affect the average response, but 
also to identify the factors that contribute to the variability of the response. In 
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the practical setting of product design this means more specifically that we need 
to model both the location and the dispersion of the response or, as it is often 
referred to, the performance characteristic in order to find an appropriate level 
combination of the control factors that will achieve the desired value of or target 
for the performance characteristic. 

Let us consider the cross array of Section 17.4.1. Suppose the inner or design 
array D\ consists of N\ treatment combinations and the outer or noise array D 2 
consists of N 2 treatment combinations. The total number of runs in D = D\ x D 2 
then is N = N\N 2 . Let A denote the design factor settings, then, following 
Nair (1986), we can write a location-scale model for the (possibly transformed) 
observations as 


Y = fM(A)-\-cr(A)e (17.5) 

where the random component e does not depend on A and is assumed to have 
mean 0 and variance 1. The terms /x(A) and a {A) represent the location and 
dispersion effects, respectively. 

We should emphasize that using model (17.5) implies that the experiment has 
been carried out according to a completely randomized design. In practice this 
may not always be possible and some form of blocking may have to be used as 
discussed in Chapters 11 and 12. Another possibility is the use of a split-plot- 
type design (see 1.13.4), where, for example, the treatment combinations of the 
design array represent the whole-plot treatments and the treatment combinations 
of the noise array represent the split-plot treatments. In either case it is easy to 
modify model (17.5) by either incorporating the block effects or the whole-plot 
error term, both of which are independent of the design factors. 

Using the loss function (17.1) the expected loss (17.4) for the response (17.5) 
can be written as 


L = E[l(Y)] = c E(Y - r) 2 
=c[fi(A) — r] 2 + ca 2 (A) (17.6) 

This form of L shows explicitly that in order to minimize the mean-squared 
error (17.6) we need to identify the setting of the design factors such that /x(A) 
achieves the target value r and <t 2 (A) is minimized. 

Now let A = (A\, A 2 ,..., A n ) be the set of design factors employed in the 
experiment. Then the aim of the subsequent analysis is to divide A into three 
sets, say, A = A 2 , A 3 ) (Phadke, 1982), where 

A\ represents the factors that affect dispersion (they are referred to as control 
factors); 

A 2 represents the factors that affect location but not dispersion (they are 
referred to as adjustment factors); 
represents the factors that neither affect location nor dispersion. 
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According to this division the factors included in A\ and A 2 represent the 
actual design factors (Box, 1988). Such a division is possible only if we can get 
rid of dispersion effects that arise because of a dependence of a on /z (Box and 
Meyer, 1986b; Box, 1988). This can often be achieved through an appropriate 
transformation of the original data or through the use of a performance mea¬ 
sure independent of adjustment (PerMIA) as proposed by Leon, Shoemaker, and 
Kackar (1987). We assume that Y in (17.5) is of that form. 

To achieve the division of A let us now consider the observations Y from the 
experiment based on the cross array D. More specifically, let ytj be the (possibly 
transformed) observation for the /th treatment combination of D\ and the jth 
treatment combination of D 2 (i = 1,2,N \； j = 1,2,..., A^). In some sense 
the N 2 observations, stemming from D 2 , for each treatment combination of D\ 
can be considered as replications for that treatment combination. It is, therefore, 
plausible to analyze 


l n 2 1 n 2 

y-- = Y 2 ^ yij and = 

;'=i 7=1 

for i = 1 , 2 ,, A/\, separately as measures of location and variability, respec¬ 
tively. The analysis of the y L conforms, of course, to the usual analysis of 
fractional factorial designs. The sf are obtained over the same design points of 
D 2 and, hence, any differences among them can be attributed to the design fac¬ 
tors. This provides justification for analyzing the sf, or more appropriately log sf, 
in the same way as the y L in order to identify the factors belonging to A\ and 
A 2 . For both variables we can use the tools of the analysis of variance. 

We shall illustrate this approach with the data obtained from a cross-array 
experiment described by Engel (1992). 


Example 17.5 (Engel, 1992) Seven design factors and three noise factors, 
each at two levels, were included in the experiment in an effort to identify the 
design factors that had an effect on the mean of and variation in the percentage 
of shrinkage of products made by injection moulding. The factors are (rather 
than using the labels Ai, A 2 ,..., A 7 and B\, B 2 , we retain the labels used 
by Engel): 


Design Factors Noise Factors 


A: cycle time 
B: mould temperature 
C: cavity thickness 
D: holding pressure 
E: injection speed 
F: holding time 
G: gate size 


M: percentage regrind 
N: moisture content 
O: ambient temperature 



MODELING OF DATA 


641 


The cross-array D is given by crossing D\ = with D 2 = 2^ 1 , were D\ 
has the identity relationship 

I = ABC = DEFG = ABCDEFG = ADE = BCDE 
=AFG = BCFG = BEG = ACEG = BDF = ACDF 

and D 2 has the identity relationship 

I =MNO 


The 32 design points with their observations are listed in Table 17.2 together 
with the values for sf (note that as a matter of convenience each sf value is 
listed four times for the four replications of each design point in D\). The results 
of the analysis of variance for the y t and the log sf are given in Table 17.2. 

Concerning the location modeling aspect, we may conclude by looking infor¬ 
mally and somewhat subjectively at the P values (or half-normal plots) for the 
estimated main effects, that the factors A, D, and possibly G constitute A 2 and 
hence represent the adjustment factors. A separate regression analysis using A, 
D, and G yields the prediction equation 

Y = 2.3375 + 0.85x a - 0.5625x D - 0.4625x 6 

where xa, xd, xg represent the coded levels of A, D, and G, respectively. 
Together with engineering and production considerations this equation can then 
be used to adjust the process so that a particular target, r, can be achieved. 

With regard to the dispersion modeling the results point clearly to factor F 
as having a pronounced effect on the variability and hence constituting the only 
control factor. A look at the data shows, indeed, that the sf for those treatment 
combinations in D\ with F = 0 are much smaller than for those with F = l. 

For a different approach to dispersion modeling and analysis of the same data 
set we refer the reader to Engel (1992). □ 


17.5.2 Dual-Response Modeling 

We have pointed out earlier that cross-array experiments allow for the estimation 
of 2-factor interactions between design and noise factors. Yet, in the preceding 
discussion we have assumed that such interactions either do not exist or have been 
eliminated through an appropriate transformation of the original observations. 
Also, the location and dispersion modeling approach as discussed in the previous 
section is obviously not appropriate for single-array experiments. 

In order to deal with these situations we shall describe now a two-step mod¬ 
eling procedure that Myers, Khuri, and Vining (1992) and Vining and Myers 
(1990) referred to as a dual-response approach. The basic idea is to express an 



642 


ROBUST-DESIGN EXPERIMENTS 


Table 17.2 Robust-Design Experiment Cross Array Data Points, 
Data, and Analysis 

options nodate pageno=l LS=75 ； 
data robust ； 

input ABCDEFGMNOYS@@; 
logS=log(S )； 
datalines ； 

0000000000 2.2 0.0092 0000000011 2.1 0. 

0000000101 2.3 0.0092 0000000110 2.3 0. 

0001111000 0.3 1.7700 0001111011 2.5 1. 

0001111101 2.7 1.7700 0001111110 0.3 1. 

0110011000 0.5 2.1000 0110011011 3.1 2. 

0110011101 0.4 2.1000 0110011110 2.8 2. 

0111100000 2.0 0.0092 0111100011 1.9 0. 

0111100101 1.8 0.0092 0111100110 2.0 0. 

1010101000 3.0 0.0025 1010101011 3.1 0. 

1010101101 3.0 0.0025 1010101110 3.0 0. 

1011010000 2.1 1.8733 1011010011 4.2 1. 

1011010101 1.0 1.8733 1011010110 3.1 1. 

1100110000 4.0 1.7625 1100110011 1.9 1. 

1100110101 4.6 1.7625 1100110110 2.2 1. 

1101001000 2.0 0.0067 1101001011 1.9 0. 

1101001101 1.9 0.0067 1101001110 1.8 0. 


run ； 


proc print data=robust ； 

titlel 'TABLE 17.2'; 

title2 'ROBUST-DESIGN EXPERIMENT'; 

title3 'CROSS ARRAY DATA POINTS AND DATA'; 

run ； 

proc glm data=robust ； 


class A B 

C 

D E F 

G; 

model Y logs 

=A 

B C D E F G; 

estimate 

'A' 

A -1 

l/divisor=16 

estimate 

'B' 

B -1 

l/divisor=16 

estimate 

'C' 

C -1 

l/divisor=16 

estimate 

'D' 

D -1 

l/divisor=16 

estimate 

'E' 

E -1 

l/divisor=16 

estimate 

'F' 

F -1 

l/divisor=16 

estimate 

'G' 

G -1 

l/divisor=16 


title2 'IDENTIFICATION OF ADJUSTMENT AND DISPERSION FACTORS 

title3 'THROUGH ANALYSIS OF VARIANCE'; 

run ； 

proc reg data=robust ； 
model Y = A D G ； 

title2 'ESTIMATING PREDICTION EQUATION 1 ; 
title3 'USING ADJUSTMENT FACTORS A, D, G'; 


0092 

0092 

7700 

7700 

1000 

1000 

0092 

0092 

0025 

0025 

8733 

8733 

7625 

7625 

0067 

0067 
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Table 17.2 (Continued) 


Obs 

A 

B 

c 

D 

E 

F 

G 

M 

N 

0 

Y 

S 

LogS 

1 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

2 

.2 

0.0092 

-4.68855 

2 

0 

0 

0 

0 

0 

0 

0 

0 

1 

1 

2 

.1 

0.0092 

-4.68855 

3 

0 

0 

0 

0 

0 

0 

0 

1 

0 

1 

2 

.3 

0.0092 

-4.68855 

4 

0 

0 

0 

0 

0 

0 

0 

1 

1 

0 

2 

.3 

0.0092 

-4.68855 

5 

0 

0 

0 

1 

1 

1 

1 

0 

0 

0 

0 

.3 

1.7700 

0.57098 

6 

0 

0 

0 

1 

1 

1 

1 

0 

1 

1 

2 

.5 

1.7700 

0.57098 

7 

0 

0 

0 

1 

1 

1 

1 

1 

0 

1 

2 

.7 

1.7700 

0.57098 

8 

0 

0 

0 

1 

1 

1 

1 

1 

1 

0 

0 

.3 

1.7700 

0.57098 

9 

0 

1 

1 

0 

0 

1 

1 

0 

0 

0 

0 

.5 

2.1000 

0.74194 

10 

0 

1 

1 

0 

0 

1 

1 

0 

1 

1 

3 

.1 

2.1000 

0.74194 

11 

0 

1 

1 

0 

0 

1 

1 

1 

0 

1 

0 

.4 

2.1000 

0.74194 

12 

0 

1 

1 

0 

0 

1 

1 

1 

1 

0 

2 

.8 

2.1000 

0.74194 

13 

0 

1 

1 

1 

1 

0 

0 

0 

0 

0 

2 

. 0 

0.0092 

-4.68855 

14 

0 

1 

1 

1 

1 

0 

0 

0 

1 

1 

1 

.9 

0.0092 

-4.68855 

15 

0 

1 

1 

1 

1 

0 

0 

1 

0 

1 

1 

.8 

0.0092 

-4.68855 

16 

0 

1 

1 

1 

1 

0 

0 

1 

1 

0 

2 

. 0 

0.0092 

-4.68855 

17 

1 

0 

1 

0 

1 

0 

1 

0 

0 

0 

3 

. 0 

0.0025 

-5.99146 

18 

1 

0 

1 

0 

1 

0 

1 

0 

1 

1 

3 

.1 

0.0025 

-5.99146 

19 

1 

0 

1 

0 

1 

0 

1 

1 

0 

1 

3 

. 0 

0.0025 

-5.99146 

20 

1 

0 

1 

0 

1 

0 

1 

1 

1 

0 

3 

. 0 

0.0025 

-5.99146 

21 

1 

0 

1 

1 

0 

1 

0 

0 

0 

0 

2 

.1 

1.8733 

0.62770 

22 

1 

0 

1 

1 

0 

1 

0 

0 

1 

1 

4 

.2 

1.8733 

0.62770 

23 

1 

0 

1 

1 

0 

1 

0 

1 

0 

1 

1 

. 0 

1.8733 

0.62770 

24 

1 

0 

1 

1 

0 

1 

0 

1 

1 

0 

3 

.1 

1.8733 

0.62770 

25 

1 

1 

0 

0 

1 

1 

0 

0 

0 

0 

4 

. 0 

1.7625 

0.56673 

26 

1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

1 

.9 

1.7625 

0.56673 

27 

1 

1 

0 

0 

1 

1 

0 

1 

0 

1 

4 

.6 

1.7625 

0.56673 

28 

1 

1 

0 

0 

1 

1 

0 

1 

1 

0 

2 

.2 

1.7625 

0.56673 

29 

1 

1 

0 

1 

0 

0 

1 

0 

0 

0 

2 

. 0 

0.0067 

-5.00565 

30 

1 

1 

0 

1 

0 

0 

1 

0 

1 

1 

1 

.9 

0.0067 

-5.00565 

31 

1 

1 

0 

1 

0 

0 

1 

1 

0 

1 

1 

.9 

0.0067 

-5.00565 

32 

1 

1 

0 

1 

0 

0 

1 

1 

1 

0 

1 

.8 

0.0067 

-5.00565 
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Table 17.2 {Continued) 


IDENTIFICATION OF ADJUSTMENT AND DISPERSION FACTORS 
THROUGH ANALYSIS OF VARIANCE 


The GLM Procedure 


Dependent Variable : Y 




Sum of 




Source 

DF 

Squares 

Mean Square 


F Value 

Model 

7 

11.00000000 

1.57142857 


1.67 

Error 

24 

22.60000000 

0.94166667 



Corrected 

Total 31 

33.60000000 





R-Square 

: Coeff Var 

Root MSE 

y 

Mean 


0.327381 

43.12867 

0.970395 2. 

.250000 

Source 

DF 

Type I SS 

Mean Square 


F Value 

A 

1 

5.78000000 

5.78000000 


6.14 

B 

1 

0.18000000 

0.18000000 


0.19 

C 

1 

0.12500000 

0.12500000 


0.13 

D 

1 

2.53125000 

2.53125000 


2.69 

E 

1 

0.66125000 

0.66125000 


0.70 

F 

1 

0.01125000 

0.01125000 


0.01 

G 

1 

1.71125000 

1.71125000 


1.82 

Source 

DF 

Type III SS 

Mean Square 


F Value 

A 

1 

5.78000000 

5.78000000 


6.14 

B 

1 

0.18000000 

0.18000000 


0.19 

C 

1 

0.12500000 

0.12500000 


0.13 

D 

1 

2.53125000 

2.53125000 


2.69 

E 

1 

0.66125000 

0.66125000 


0.70 

F 

1 

0.01125000 

0.01125000 


0.01 

G 

1 

1.71125000 

1.71125000 


1.82 




Standard 



Parameter 


Estimate 

Error 

t 

Value 

A 


0.05312500 

0.02144291 


2.48 

B 

- 

0.00937500 

0.02144291 


-0.44 

C 


0.00781250 

0.02144291 


0.36 

D 

- 

0.03515625 

0.02144291 


-1.64 

E 


0.01796875 

0.02144291 


0.84 

F 

- 

0.00234375 

0.02144291 


-0.11 

G 

■ 

0.02890625 

0.02144291 


-1.35 


Pr > F 
0.1646 


Pr > F 

0.0207 
0.6659 
0.7188 
0.1141 
0.4103 
0.9139 
0.1902 

Pr > F 

0.0207 
0.6659 
0.7188 
0.1141 
0.4103 
0.9139 
0.1902 


Pr > ItI 

0.0207 
0.6659 
0.7188 
0.1141 
0.4103 
0.9139 
0.1902 
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Table 17.2 (Continued) 

IDENTIFICATION OF ADJUSTMENT AND DISPERSION FACTORS 
THROUGH ANALYSIS OF VARIANCE 

The GLM Procedure 

Dependent Variable : Logs 




Sum of 




Source 

DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 

7 

266.4310915 

38.0615845 

Infty 

< . 0001 

Error 

24 

0.0000000 

0.0000000 



Corrected 

Total 31 

266.4310915 





R-Square 

Coeff Var 

Root MSE LogS Mean 



1.000000 

0 

0 -2 

.233358 


Source 

DF 

Type I SS 

Mean Square 

F Value 

Pr > F 

A 

1 

1 .5111751 

1.5111751 

Infty 

< . 0001 

B 

1 

0.6003957 

0.6003957 

Infty 

< . 0001 

C 

1 

0.2841756 

0.2841756 

Infty 

<.0001 

D 

1 

0.3835367 

0.3835367 

Infty 

< . 0001 

E 

1 

0.7414489 

0.7414489 

Infty 

< . 0001 

F 

1 

261.7830683 

261.7830683 

Infty 

< . 0001 

G 

1 

1.1272911 

1.1272911 

Infty 

< . 0001 

Source 

DF 

Type III SS 

Mean Square 

F Value 

Pr > F 

A 

1 

1.5111751 

1.5111751 

Infty 

< . 0001 

B 

1 

0.6003957 

0.6003957 

Infty 

< . 0001 

C 

1 

0.2841756 

0.2841756 

Infty 

< . 0001 

D 

1 

0.3835367 

0.3835367 

Infty 

< . 0001 

E 

1 

0.7414489 

0.7414489 

Infty 

<.0001 

F 

1 

261.7830683 

261.7830683 

Infty 

< . 0001 

G 

1 

1.1272911 

1.1272911 

Infty 

< . 0001 




Standard 



Parameter 


Estimate 

Error t 

Value 

Pr > |t| 

A 

- 

0.02716392 

0 

-Infty 

<.0001 

B 


0.01712197 

0 

Infty 

< . 0001 

C 

- 

0.01177954 

0 

-Infty 

< . 0001 

D 


0.01368480 

0 

Infty 

< . 0001 

E 

- 

0.01902723 

0 

-Infty 

< . 0001 

F 


0.35752449 

0 

Infty 

<.0001 

G 

- 

0.02346135 

0 

-Infty 

< . 0001 
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Table 17.2 (Continued) 

ESTIMATING PREDICTION EQUATION 
USING ADJUSTMENT FACTORS A, D, G 

The REG Procedure 





Model 

: MODEL1 






Dependent 

Variable : Y 






Analysis 

of Variance 






Sum of 




Source 


DF 

Squares 

Mean Square 

F Value 

Pr > F 

Model 


3 

10.02250 

3.34083 

3.97 

0.0178 

Error 


28 

23.57750 

0.84205 



Corrected 

Total 

31 

33.60000 





Root 

MSE 

0.91763 R-Square 

0.2983 



Dependent 

Mean 2.25000 Adj R-Sq 

0.2231 



Coef f 

Var 

40.78377 






Parameter Estimates 






Parameter 

Standard 



Variable 

DF 


Estimate 

Error t 

Value 

Pr > |t| 

Intercept 

1 


2.33750 

0.32443 

7.20 

<.0001 

A 

1 


0.85000 

0.32443 

2.62 

0.0140 

D 

1 


-0.56250 

0.32443 

-1.73 

0.0940 

G 

1 


-0.46250 

0.32443 

-1.43 

0.1650 


observation as a function of design and noise factor effects as well as the 2-factor 
interactions between design and noise factors, or 

Y = /x(A) + v(^) + HA, B) + ae (17.7) 

where B denotes the noise factor settings. Contrary to (17.5), in (17.7) a is 
independent of the design factors. Even though the noise factors are considered 
to be random factors during the use of the product, we need to remember that 
for the purpose of the robust-design experiment their levels are selected by the 
experimenter. It is for this reason that we first consider (17.7) conditionally on 
B in order to identify important design and noise factors. Using those factors we 
then consider the predicted value Y and consider var(F) as a second response in 
an effort to reduce the variability. 
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More specifically, let us now write (17.7) as 

Y = Xp + Zy^W& + ae (17.8) 

where X^xp, ^Nxq, and Wmxs are known design model matrices associated 
with the design factor effects the noise factor effects y qx \, and the 

design-noise factor interactions 8 SX \, respectively. It is convenient and useful 
here to think of model (17.8) as a regression or response surface model. Depend¬ 
ing on the experimental design used, p may include linear and quadratic main 
effects and possibly some interaction effects between design factors, and 8 may 
include some or all 2-factor interactions between design and noise factors. 

As indicated earlier, in step 1 we analyze Y conditional on y and 8 using anal¬ 
ysis of variance. Suppose we identify C P ， y 0 C y, and So C S as important 
effects. We write a model for the predicted value then as 

y = ^ Q x 0 + ?qZ 0 +^o u, o (17.9) 

where xo represents the linear and quadratic terms for design factors retained 
in (17.9), zo represents linear terms for noise factors retained in (17.9), and 
wo represents cross-product terms xz associated with design and noise factor 
interaction terms retained in (17.9). For example, (17.9) might be of the form 

J = A) + + ^ll^i + ^2X2 + Y2Z2 + Y3Z3 + ^\2X\Z2 + ^23^2Z3 (17.10) 

in which case % = (H ^ 11 ,^ 2 ), ?o = (V2, 巧 ) ，瓦 =(^12,^23) and x’ 0 = 
(1 ， XI, xf, X2 ), Zq = (Z2, Z3), Wq = (x\Z2, X2Z3), with Xi referring to the ith design 
factor and Zj to the jth noise factor. 

Given 办 0 , y^, So and assuming now that the z and w terms are random 
variables, we model the location as 

E(y\^ 0 ,y 0 M)=fo^o (17.11) 

and the variability as 

var(y|j8 0 , y 0 , 8 0 ) = var^zo) + varC^^o) 

+ 2cov()/ozo, fo^o) + cr 2 (17.12) 

To illustrate (17.11) and (17.12) we consider (17.10) and find 

E(y) = % + ^ 1 x 1 + p n xi + $ 2^2 ( 17 . 13 ) 
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and 


var(y) = (? 2) 2 ^2 + (Ks) 2 ^ + (5i2^i)\ 2 2 
+ (如 3 又 2) 2 <^ 2 3 + ^Y2^l2Xl cr^ 2 
+ 20 ^ 23^2 cr^ 3 + g 2 
=l(?2) 2 + (了 12又1) 2 + ^?2^12X \]cr^ 2 

+ [(巧 ) 2 + (^23 X 2) 2 + 2 y 3<523^2]^ 2 3 

-ha 2 (17.14) 

Assuming that the levels for the noise factors and are chosen such that 
a z 2 2 = cr^ 3 = 1, then an estimator for (17.14) can be written as 

var(y|^ 0 , y 0 ,? 0 ) = [(n ) 2 + (?i 2 xi) 2 + ^y^nxn] 

+ [(K3) 2 + (%^2) 2 + 2y 3 {%x 2 ] + MS(£) (17.15) 

where MS ( 五 ） is obtained from the ANOVA using model (17.8). By choosing 
the levels of the noise factors such that a}. = 1， we also assume that under 
field conditions the z values represent a random sample of levels within the 
same range. 

The response surface (17.11) in general or (17.13) for the illustrated case will 
then be used to adjust the process to the target value. At the same time the 
response surface (17.12) in general or (17.15) for the illustrated case will serve 
to reduce the variance by using the identified adjustment factors. In general, these 
two objectives cannot be achieved independently of each other and hence require 
engineering as well as statistical considerations. For examples, see Myers, Khuri, 
and Vining (1992) and Vining and Myers (1990), who also point out how this 
may involve sequential experimentation. 

Another way of making use of the design-noise factor interactions is to con¬ 
sider, for each interaction, the interaction plot and choose the design factor level 
that has the flattest response over the range of the particular noise factor (see Wu 
and Hamada, 2000). 

Finally, an extension of model (17.8) and the analysis based on it has been 
proposed by Wolfinger and Tobias (1998). Their mixed linear model allows a 
more direct approach of modeling variability in addition to location. However, 
this approach requires the assumption of normality for the distribution of the 
dispersion and random effects in order to perform maximum likelihood or REML 
estimation. 



CHAPTER 18 


Lattice Designs 


18.1 DEFINITION OF QUASI-FACTORIAL DESIGNS 

We have previously discussed the desirability and usefulness of incomplete block 
designs (see Chapter 1) when the number of treatments is large and/or complete 
blocks are unavailable or inappropriate. In this chapter we describe yet other 
methods of constructing certain types of resolvable incomplete block designs, 
some of which are in fact BIB or PBIB designs, but others are not. These meth¬ 
ods are based on a correspondence between the treatments to be compared and 
the treatment combinations of a factorial set such that the procedures for con¬ 
structing systems of confounding (see Chapter 11) can be utilized. Because of 
this correspondence, these designs are referred to as quasi-factorial or lattice 
designs. We emphasize, however, that the actual treatments do not have a factorial 
structure. 

Historically, lattice designs were developed for large-scale agricultural exper¬ 
iments (Yates, 1936b) in which large numbers of varieties were to be compared. 
The main application since then has been and continues to be in agriculture. 

A special feature of lattice designs is that the number of treatments, t, is related 
to the block size, k, in the form t = k 2 or t = k 3 or t = k(k + 1) (see Section 
18.2). Even though this limits the number of possible designs (see Section 6.2), 
lattice designs represent an important class of designs nevertheless, in particu¬ 
lar when one is dealing with a large number of treatments. In certain types of 
agronomic experiments the number of treatments can easily be 100 or more, for 
example, in breeding experiments. 


18.1.1 An Example: The Design 

We shall illustrate the basic idea of a lattice design with a simple example. 
Suppose we have t = 4 treatments t\, t 2 , t\ and blocks of size 2. We set 
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up the following correspondence between the four treatments and the treatment 
combinations of a 2 2 factorial (using the notation of Section 7.2.4): 

Treatment t\ t 2 ，3 

Quasi-factorial combination (1) a b ab 

To arrange the treatments in blocks of size 2 we make use of a system of 
partial confounding (see Chapter 9). We may, for example, use three different 
types of confounding in three replicates, that is, 

Replicate Confounding Blocks 

I A (1), b and a, ab 

II B (1), a and b, ab 

III AB (1), ab and a, b 

In terms of the treatments t\, t 2 , ? 3 , t\ the arrangement is then as follows: 

Replicate I Replicate II Replicate III 




t\, t2 



t2, U 


t4 


t2, 


Suppose that this basic pattern is repeated q times, so that there are 3q repli¬ 
cates altogether. 

18.1.2 Analysis 

Using the results of Chapter 9 we obtain the estimates Anjn, Bun, ABm ，In 
order to make comparisons among the treatments t\, t2, t\ or their effects 
ri, T 2 , T 3 , T 4 , respectively, we apply the parameterization (7.49). In this way we 
obtain, for example, 

^1—^2= —Mini + A Bin 
?i — t3 = — 召 i，iii + ABiu 
fi - r4 = -An,m + 5i，ni 

and other contrasts can be estimated similarly. Since An ， m, ^i ， m，and ABi m 
are independent with 

var(A IUII ) = var(5 un ) = var(A5 U i) = a;/2q 
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Table 18.1 T | B-ANOVA for Lattice Design 


Source 

d.f. 

SS 

x P \o 

3q - 1 

Usual 

X^\3,X P 


Usual 

X Z \3,X P ,X^ 

3 

2 ?{[A n ,in] 2 + [ 妄 i,m] 2 + [A5i,ii] 2 } 

I\0,X p ,X^,X T 

3(2^ - 1) 

Subtraction 

Total 

\ 2 q - 1 

Usual 


it follows then that 


var(r；- - r} ) = (y]j 2 q 


for /, j = 1,2, 3,4, / ♦ 

To estimate we turn to the T|B-ANOVA given in Table 18.1, which follows 
from the familiar model [see (9.1)] 


y = fiO + X pp + X P* + X t t + e 

Hence MS(/|3, X p , X^*, X T ) is an estimate of 

Based upon the factorial correspondence it is also easy to obtain the combined 
estimator for treatment comparisons. Using the notation of Section 9.2.3, we 
combine the intrablock estimator An,m and the interblock estimator A\ and obtain 
the combined estimator for A as 


2 _ 2qwAiiiu-\- qw^i 

2 qw qw r 

wA\\ + wAm + w f A\ 
2 w + w f 


(18.1) 


where we have used the fact that An,m = \ (An + Am) and w = l/〜 2 , w/ = 
l/[cr^ + 2cx^]. Similarly 



B = 

wB\ + wB\u + w^u 

(18.2) 


2 w + w , 

and 

Tb = 

wAB\ + wABu + w f A Em 

(18.3) 


2 w + w r 

Since 

var (A)= 

var ( = var f = - i - 

V / V / q(2w + w f ) 




652 


LATTICE DESIGNS 


Table 18.2 B | T-ANOVA for Lattice Design 

Source d.f. SS 


X P \J 

3《- 

x T \o,x p 

3 

x^\o,x p ,x T 

3^ 

A x reps I 

q - i 

B x reps II 

q - i 

AB x reps III 

q - i 

An,in vs. Ai 

l 

Bijn vs. B\\ 

l 

ABm vs. A5 hi 

l 

i\o,x^*x T 

\2q 


Usual 

Usual 



From Table 18.1 


五 (MS) 

a e + 
a e + 
a e + 

°^\ a l 

+ 

a l + ^ a l 


it follows then that, for example, 

var (z\ — 〒 2) = var (-A + = --- 

V 7 V ) q{2w^w f ) 

The variance of other contrasts can be worked out similarly. To estimate w and 
w/ use is made of the T | B-ANOVA of Table 18.1 and the B | T-ANOVA of 
Table 18.2, which follows the format of Table 9.11. We thus obtain 

苟 = ^ = 々， : f r ) (18.4) 

o] + 2aj = 去 = 

-—^—MS(I\0,X p ,X r ,X T ) (18.5) 

3q — l 

Substituting (18.4) and (18.5) into (18.1) ， (18.2), and (18.3) then yields the 
combined estimator for A, B, and AB, respectively, and with that the combined 
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estimators for r, — Tj, for example, 

2wAiim + w f A\ 2wABin + w , A Bm 


t：i _ T2 =- 

with estimated variance 


2w + w r 


2w + w f 


var 


(ri-rj) 


2qw + qw f 


(18.6) 


for all i, j = 1,2, 3,4, / ^ j. 

The fact that each treatment comparison is estimated with the same variance, 
whether we use intrablock information only or combined intra- and interblock 
information, suggests that this lattice design is in fact a BIB design. This can 
be verified easily, and this design serves as an illustration of the construction 
procedure referred to in Section 3.4. Because of the feature mentioned above, 
this particular lattice design is called a balanced lattice. It is just one of several 
types of lattice designs as we shall describe in the next section. 


18.1.3 General Definition 

Even though we have only discussed a simple example, the general idea of 
a lattice design can be stated as follows: The treatments are identified with the 
treatment combinations of a symmetrical factorial. The implied factorial structure 
is then used to construct a resolvable design in incomplete blocks by using 
different systems of partial confounding of main effects and interactions. The 
same factorial (or better: quasi-factorial) structure can then be used to obtain 
various types of estimators, that is, intrablock, interblock, combined estimators, 
for contrasts of treatment effects, together with their variances. This latter point is 
no longer as important as it was at the time when these designs were developed. 
We can now simply refer to the procedures described in Sections 1.8 and 1.14. On 
the other hand, the derivations in terms of the factorial are useful for establishing 
general properties of lattice designs, as, for example, whether they are BIB or 
PBIB designs. 


18.2 TYPES OF LATTICE DESIGNS 

The existing lattice designs can be classified according to 

1. Number of treatments, t 

2. Block size, k 

3. Number of different systems of confounding used, s 

4. Number of restrictions imposed on randomization, c 
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Using these parameters a partial list consisting of the most commonly used lattice 
designs is given in Table 18.3. Although this table mentions only lattice designs 
with c = 1, 2, and t = K 2 , K\ K(K — 1), it should be clear how this classifi¬ 
cation can be extended to «-dimensional lattices, that is, ^ = K n (n = 4, 5,. • • 
with c restrictions, c = 1, 2, 3,... (e.g., Kempthorne and Federer, 1948a, 1948b; 
Federer, 1955). 

Balanced lattices have the same properties as BIB designs, in that each treat¬ 
ment occurs together once with every other treatment in the same block, which 


Table 18.3 Types of Lattice Designs 


Type 

c 

t 

k 

s 

One-restrictional two-dimensional 





(square) lattices 

1 

K 2 



Simple (double) lattice 



K 

2 

Triple lattice 



K 

3 

Quadruple lattice 



K 

4 




6) 


Balanced lattice 



K a 

K + \ 

One-restrictional three-dimensional 





(cubic) lattices 

1 

K 3 



Triple lattice 



K 

3 

Quadruple lattice 



K a 

4 

Balanced lattice 



K a 

K 2 -\-K + 1 

Two-restrictional two-dimensional 





lattices 

2 

K 2 



Semibalanced lattice square 



K a 

(K + 1)/2 

Balanced lattice square 



K a 

K + \ 

Unbalanced lattice square 



K a 

>2 

Unbalanced lattice square 



6 

<3 

Unbalanced lattice square 



10 

<4 

Unbalanced lattice square 



12 

<7 

Unbalanced lattice square 



14 

<5 

Unbalanced lattice square 



15 

<6 

One-restrictional two-dimensional 





rectangular lattices 

1 

K(K - 1) 



Simple rectangular lattice 



K - 1 

2 

Triple rectangular lattice 



K - 1 

3 

Near balanced 





rectangular lattice 



K-\ a 

K 


K is prime or prime power. 
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implies that all treatment differences are estimated with the same variance. The 
number of different systems of confounding ( 夕 ) needed to achieve such balance 
depends on the dimensionality (n) of the design, increasing roughly exponentially. 
Even for moderate values of n, it may therefore be difficult, from a practical point 
of view, to achieve balance. 

Lattice designs other than balanced lattices have at least the properties of PBIB 
designs with two or more associate classes (see Section 18.7). The method of 
constructing lattice designs using different systems of confounding provides thus 
an easy and convenient way to construct PBIB designs of certain types. 

With regard to rectangular lattices one can envision more general designs 
with t = KL in blocks of size k = L with L < K — l. The most important 
case, however, and the one usually referred to as rectangular lattice is that of 
L = K — Apart from using blocks of size k = K — l instead of K as for the 
other (square) lattice designs, the rectangular lattice of the above type is suitable 
for t values between (K — l) 2 and K 2 , thus providing additional incomplete 
block designs. Even though the rectangular lattices can be constructed easily 
(see Section 18.11), they are not based on systems of confounding, that is, they 
are not based on the method of pseudofactors. It has been shown (Nair, 1951) that 
rectangular lattices are PBIB designs or even more general than PBIB designs. 


18.3 CONSTRUCTION OF ONE-RESTRICTIONAL 
LATTICE DESIGNS 

18.3.1 Two-Dimensional Lattices 

We consider first the case where K is prime or prime power. The K 2 treatments 
can then be represented by the treatment combinations of a K 2 factorial with 
pseudofactors A and B. A simple lattice with blocks of size K is obtained by 
confounding A in one replicate (sometimes referred to as the X arrangement) and 
B in the other replicate (sometimes referred to as the Y arrangement). For a triple 
lattice we may confound A, B, and AB, respectively, in the three replicates, and 
for a quadruple lattice A, B, AB, and AB 2 , respectively. Continuing in this fash¬ 
ion, a balanced lattice is obtained by confounding A, B, AB, AB 2 ,..., AB K ~ l , 
respectively, in the + 1 replicates. 

This method of using systems of confounding to construct lattice designs is, 
of course, equivalent to arranging the K 2 treatments in a square array with the 
cell in the zth row and jth column containing the treatment combination atbj and 
using for a simple lattice the rows as blocks for one replicate and the columns 
as blocks for the other replicate. For a triple lattice the rows, columns, and Latin 
letters of a Latin square constitute the three replicates. Since a Latin square exists 
for any K, this implies that we can construct a triple lattice for every K • For 
a quadruple lattice the blocks of the four replicates are generated by the rows, 
columns, Latin letters, and Greek letters, respectively, of a Graeco-Latin square 
of order K. This is possible for all K except K = 6. The types of lattice designs 
that can be constructed for a given K depends then on how many mutually 
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orthogonal Latin squares (MOLS) exist for that K. For example, for K = 10, 
there exist only 2 MOLS, hence only simple, triple and quadruple lattices can 
be constructed; for K = \2 there exist 5 MOLS so that lattices up to a septuple 
lattice can be constructed, similarly, for K = 14, 15 there exist 3, 4 MOLS, 
respectively, so that lattices up to a quintuple, sextuple lattice, respectively, can 
be constructed (see Abel et al., 1996). Hence, K — l MOLS and balanced lattices 
exist, however, only for K prime or prime power. 

18.3.2 Three-Dimensional Lattices 

The replicates of these designs consist of K 2 blocks of size K. Expressing the K 3 
treatments as the treatment combinations of a K 3 factorial with factors A, B, C 
such replicates can be obtained for K a prime or prime power by confounding any 
two effects or interactions and their GIs. In order to avoid complete confounding 
of any effect or interaction, which would lead to a disconnected design, at least 
three replicates are needed giving a triple lattice. We may specify a set of three 
replicates as follows: 


Rep I: Confound A, B, AB, AB 2 , ... ， AB K_1 
Rep II: Confound A, C, AC, AC 2 , … ， AC K ~ 1 
Rep III: Confound B, C, BC, BC 2 ,..., BC K ~ l 

which are sometimes referred to as X, F, Z arrangements, respectively. 

Alternatively, we can arrange the treatments in sl K x K x K cube such that 
the treatment in the ith row, jth column, and ith layer is aibjci ，The blocks for 
the three replicates can then be specified as follows: 


Rep I: {aibjCi, £ = 1,2, ..., K] for each i, j = 1,2, …， K 

Rep II: {aibjCi, j = 1,2,..., K] for each i ， £ = 1,2,…， K 

Rep III: {aibjCi, i = 1,2,..., K] for each = 1,2,K 

Obviously, such an arrangement is possible for every K, so that for a triple lattice 
K does not have to be prime or prime power. 

For a quadruple lattice with K prime or prime power we can specify in addition 
to the three replicates above 


Rep IV: Confound AB, AC and all GIs 


This process of adding further independent systems of confounding can be 
continued to generate further lattices. It follows from (11.15) that the total number 
of possible systems of confounding is 


(K 3 — l)(K 3 - K) K 3 - 1 
(K 2 - 1)(K 2 — K) = K - 1 


=K 2 + K +1 


Using all such possible systems of confounding yields a balanced lattice. 
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18.3.3 Higher-Dimensional Lattices 

The methods described in the previous sections can be extended immediately to 
higher dimensions such as t = K 4 or t = K 5 . For t = K 4 and blocks of size 
K, we need at least four replicates to obtain a connected design. This may be 
achieved as follows: 

Rep I: Confound A, B, C, and GIs 

Rep II: Confound A, B, D, and GIs 

Rep III: Confound A, C, D, and GIs 

Rep IV: Confound B, C, D, and GIs 

which is equivalent to writing the blocks as 


Rep I: 

{aibjCid m ,m = 


..,K} for /, j, i = 

1,2,.. 

.,K 

Rep II: 

{ciibjC£d m , £ — 

1,2,.. 

., K) for i, j, m = 

1,2,.. 

-,K 

Rep III: 

{dibjCid m , j = 

1,2,.. 

., K) for /, = 

1,2,.. 

.,K 

Rep IV: 

{ciibjc^d m , i = 

1,2,.. 

• ， K] for j, = 

1,2,.. 

.,K 


This shows that a lattice design is possible for every K. 

For a balanced lattice with t = K 4 treatments we need (K 4 — l)/(K — 1) 
different replicates. Already for [ = 3 this number, 40, is prohibitively large for 
most practical purposes. An enumeration of such a set of systems of confounding 
is given by Kempthome (1952). 

For t = K 5 the same arguments can be made, except that in this case at least 
five replicates are needed to achieve a connected design. 

Rather than considering only blocks of size K we may also envisage blocks of 
size K 2 or K 3 . Using the factorial correspondence, it should be clear how such 
designs can be constructed for K prime or prime power. Care must be taken to 
avoid complete confounding of any main effect or interaction. For other values 
of K the arrangement of the treatments in a hypercube suggests immediately how 
replicates with blocks of size K 2 or K 3 can be found by simply extending the 
principle used above. 


18.4 GENERAL METHOD OF ANALYSIS FOR 
ONE-RESTRICTIONAL LATTICE DESIGNS 

Since lattice designs are incomplete block designs, their analysis follows along 
the general principles discussed in Chapter 1, leading to intrablock or combined 
intra- and interblock estimates of estimable functions of the treatment effects. 
However, since the construction of lattice designs for K prime or prime power 
is based on the methods of factorial experiments, the analysis can be carried out 
alternatively as that of confounded factorial experiments as presented in Chapter 
11 making use of the parameterization of treatment effects in terms of main 
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effects and interactions (cf. Section 11.9). We shall now reiterate the major steps 
for this type of analysis, making extensive references to Chapter 11. 

The basic model for any one-restrictional lattice design is 


yi ji = M + P/ + Ptj + r € + e i ji (18.7) 

where p/ is a replicate effect (/ = 1 , 2 ,..., sq, q = number of repetitions of basic 
lattice design), is a block/replicate effect (j = 1,2,..., b with b depending on 
the type of lattice), and xn is a treatment effect (Ji = 1,2,..., t). Further, if treat¬ 
ment i corresponds to the pseudotreatment combination x r = (xi, X 2 ,..., xk), 
we have, using the notation of Section 11.9), 


t a = r(x) = a(x) - M = ^ E^ x (18.8) 

aeJl 

We are interested in estimating treatment contrasts 

t 

c e T e = ^ = XI XI (18.9) 

€=1 Arex ae Jl 

for 


E E c(x) = 0 

t=\ xex 


using the right-hand side (RHS) of (18.9), that is, using estimates of the E^, x . 
From the construction of lattice designs it follows that the interaction components 
in (18.9) fall into only two of the three sets defined in Section 11.6, namely 
E y ^ e £2 (partially confounded) and E 8m e 8^ (unconfounded). Thus, if we use 
intrablock information only, we obtain and in the usual way and then 


ctrt = ^2 *) 卞 OO = X! *) 
i=\ xex xex 


PH 


(18.10) 


If we use intra- and interblock information, as is usually the case with lattice 
designs, we obtain 


^2 ciU = ^ c(x)f(x) = ^ c(x) 


xex 


xex 


£2 83 


where E y f is given in (11.87). 

y i x 


(18.11) 
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A specific form of (18.9) is ri — = r(x) — r(z), say. Then (18.11) is 

given by 

— 如 =m %]+d 定 y ( i8 . 12 ) 

£ 2 s 3 


Recall that S 2 contains n 2 interactions and S 3 contains interactions with 
^ 2+^3 = (K n — l)/(K — 1). There are, however, (K n ~ l — l)/(K — 1) differ¬ 
ent E y ^ g £2 and E Sfn e £3 for which y^x = y、z and S f m x = 8 f m z, for given x 
and z, so that the right-hand side of (18.12) contains only K n ~ l terms. Denoting 
the sets of interactions for which y^x ^ y^z and 8 f m x ★ & f m z by z) and 
z), respectively, and using (11.98) we then have for the estimator (18.12) 


var 


[?(x) -f(z)] 


2aj 


q(x,z) 


E [w 

£^(x,z) 


(18.13) 


Except for balanced lattices it will be the case that (18.13) will take on different 
values so that different treatment comparisons may be estimated with different 
variances. Rather than use the variance as given by (18.13) for a given comparison 
it is common practice in the analysis of lattice designs, in particular within the 
context of multiple comparison procedures, to use the average variance of all such 
comparisons. It follows from (11.100) that such average variance is given by 

av. var = 2 Rn _ r a e [w(a) + c(a)p -1 ] -1 
aeA, 


where A is the set of all partitions for a K n factorial and 0 < u{a) < .s, 0 < 
c{ct) < ^ — 1 with u(u) + c(a) = 5 as determined by the type of lattice. Alter¬ 
natively, (18.14) can be written as 


av. var = 2 - 


K - 1 


▽ 


K n 


^ i + (s - i)p~ x 


(18.14) 


where m/ denotes the number of E a unconfounded in i replicates, that is, for 
which u{ol) = i, and hence confounded in (s — 1 ) replicates, that is, for which 
c(ot) = s — i. 

We illustrate this with the following example. 


Example 18.1 Triple three-dimensional lattice with K = 3, that is, t = 21 
treatments in blocks of size k = 3 with three replicates specifying three different 
systems of confounding. Let those systems of confounding be (see Section 18.3.2) 
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Table 18.4 Plan for Triple Three-Dimensional Lattice with 
27 Treatments 


Block 

Rep I 

Rep II 

Rep III 

1 

000 , 001, 002 

000 , 010, 020 

000 , 100, 200 

2 

100 , 101, 102 

100 , 110, 120 

010 , 110, 210 

3 

200 , 201, 202 

200 , 210, 220 

020 , 120, 220 

4 

110 , 111, 112 

001, Oil, 021 

001 , 101, 201 

5 

220 , 221, 222 

002 , 012, 022 

002 , 102, 202 

6 

010, Oil, 012 

101 , 111, 121 

011 , 111, 211 

7 

020 , 021, 022 

102 , 112, 122 

012 , 112, 212 

8 

120 , 121, 122 

201 , 211, 221 

021 , 121, 221 

9 

210 , 211, 212 

202 , 212, 222 

022 , 122, 222 


Rep I: A, B, AB, AB 2 

Rep II: A, C, AC, AC 2 
Rep III: B, C, BC, BC 2 


The experimental plan is then as given in Table 18.4. It is easy to check 
that there are three types of comparisons between treatments; that is, (18.13) 
leads to three distinct expressions for all possible comparisons. Denoting these 
three expressions by V/ (i = 0 , 1 , 2 ), where Vi is the variance of a treatment 
comparison when both treatments have i levels of the three pseudofactors in 
common, we find from (18.13) 


Vo 


3 


1 + 2 + p- 





2 


5 


l + 2p- 


2 + p 


-l 


+ 



V 2 ： 


4 


1 + 2 /r 


2 + p 


-l 


and from (18.14) 


2 

av. var = — 
13 


3 6 4 

1 + 2p~ l +2 + P - 1 + 3 



□ 


It is a property of all lattice designs discussed up to this point that (18.13) 
will result in only a few distinct expressions for all possible comparisons, similar 
to the findings in Example 18.4. This means that comparisons can be put into 
distinct sets such that all treatment comparisons in a given set are made with the 
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same precision and that the precisions are different for the different sets. This 
is, of course, also the property of PBIB designs. We shall return to this topic in 
Section 18.7. 

To estimate var^x) — T(z)] as given by (18.13), we need to estimate and 
p, or alternatively w and w r . As with factorial experiments this is achieved via 
the T I B- and B I T-ANOVA tables. The results of Sections 11.10 and 11.12, in 
particular Tables 11.1a and 11.2 are applicable to lattice designs putting p = 
K, n\ = 0, and ^2 + ^3 = (K n — l)/(K — 1). In most cases we have only one 
repetition of the basic pattern; that is, ^ = 1. We note that in the terminology 
of lattice designs the sources Remainder and vs. E n Y' of (X^* \ 3, X p , 

X T ) in Table 11.2b are often referred to as component (a) and component (b), 
respectively. 

From the discussion in Section 11.12 it should then be clear how w and w/ 
are estimated for lattice designs. As an illustration we elaborate Example 18.1. 

Example 18.1 (Continued) Using the model 

yiji = M + A- + Q + eiji 

(i = 1,2,3; 7 = 1,2, £ = 1,2,27), the TI B-ANOVA is given in 

Table 18.5. Here SS(Treatments) =SS(X T \ 0, X p , Xp*), the component SSs of 
which are computed from those replicates in which the respective effects or 
interactions are unconfounded, for example, SS(A x ^) 11 ,m is the usual sum 
of squares for the A x 5 interaction obtained from replicates II and III. The 
B I T-ANOVA is presented in Table 18.6. The partitioning of SS (Blocks/reps)= 
SS(Z^* I 3, X p , X r ) follows the general procedure outlined in Section 11.12 
with regard to computing sums of squares and determining the E(MS). We then 
obtain = l/w = MS (Residual) and with A = |, 

+ = ▲ = ! MS (Blocks/reps )— 臺 MS (Residual) 

using (11.96). □ 


18.5 EFFECTS OF INACCURACIES IN THE WEIGHTS 


We recall that an estimate of r(x) — r(z) is obtained by replacing in (18.12) w 
and w f by w and w\ respectively. We thus obtain 



£2 


£3 
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Table 18.5 T| B-ANOVA for Triple 
Three-Dimensional Lattice with 27 Treatments 


Source 

d.f. 

SS 

Replicates 

2 

^Y.yl-hy 2 - 

i 

Blocks/Reps 

24 

全 K - ^ 2Z yl 

ij i 

Treatments 

26 

sum of SS’s below 

A 

2 

SS(A)m 

B 

2 

SS(5)n 

Ax B 

4 

SS(A x 5)n，m 

C 

2 

SS(C)i 

AxC 

4 

SS(A x C^ijii 

B xC 

4 

SS(B x C)i，n 

A x B x C 

8 

SS(A x B x C)i ， ii，iii 

Residual 

28 

Difference = SS (Residual) 

Total 

80 

J2 yfn - y 2 - 


say, where 

~ a _~ a = “(《) 汸 [K' X - K' Z ] + c ( a ) 汸 ’ [K'x - K' Z ] 

(a；x) ia，z) ~ u(a)w + c(oc)w f 

using the notation of Section 11.12. It can be shown that 

- 义 a -] 

E ^oc'x ~ ^a'z ~ E: x - E^， z 

and it follows then that 


E [r(x) - r(z)] = r(x) - r(z) 
Turning now to var[r(x) — r (z)] we note first that 


var 


[K x - K z ] 


K n ~ l u(a)w 


(18.16) 


(18.17) 
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Replicates 

Treatments 

Blocks/Reps 
Component a 


Component b 


27 


Residual 


26 

24 

6 

2 

2 

2 

18 

2 

2 

2 

2 

2 

2 

2 

2 

2 

28 


EW. 

i 

J2 y 2 i 


81 


81 


又 : 


SS ⑷ + SS(b) 

SS(a) = sum of SSs below 
SS(A)i + SS(A) n - SS(A) U i 
SS(B)i + SS(5) m -SS(B)i,m 
SS(C)„ + SS(C)m-SS(C)i U ii 
SS(b) = sum of SSs below 
SS(A vs. A) 

SS(B vs. B) 

SS(AB vs. AB) 

SS(AB 2 vs. AS 2 ) 

SS(C vs. C) 

SS(AC vs. AC) 

SS(AC 2 vs. AC 2 ) 

SS(BC vs. BC) 

SS(fiC 2 vs. BC 2 ) 

From Table 18.5 


and 


var 


[H\] 


2 


1 


■ zJ K n_1 c(a)w / 
for a'x a’z. Using (18.17) and (18.18) in (18.16) we obtain 

K n ~ l ^ ^ 

[u(oc)w + c(a)w f ] 


(18.18) 


2 —. [ 驾，厂乾 , Z IA 以 


■ var 


2 


(18.19) 


Note that if w = w and w f = w\ then (18.19) reduces to l/[u(cc)w + c(a)w/]. 
If we put 


F 


U) 

w r 


U) 


w 


Table 18.6 B | T-ANOVA for Triple Three-Dimensional Lattice with 27 Treatments 

Source d.f. SS £(MS) 


2 rtp 2 rtp 2 rtp 2 rtp 2 2CQ. 2co_ 2 ftp 2 rtp 2 2 rtp 

b b b b b02CQ.2CQ.a'b 2co.cr(7cra' 

2 3 3 3 3 5-3 a a 2 2 _u 2 2 2 2 

+ + ++++++ + + + + + + + 

262^262626 26262^2^ 2^2626 2^2^ 2^26 
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then, following Kempthome (1952), the right-hand side of (18.19) can be writ¬ 
ten as 


or 


1 

u(a)w + c(a)w r 


u(a)c(a) w' 2 (F - l ) 2 
[u(a)w + c(a)w f ] 2 [u{a)w + c{cc)w f ~\ 


w 


u(a)c(a)p~ l (F — l ) 2 


u(a.) + c(a) p u{a)F + c(a)p _1 ] \u(ol) + c(a)p _1 ] J 


(18.20) 


where w/w f = p. The proportional increase in the variance of 、— E °^ z 、 
due to inaccuracies in the weights, that is, due to the fact that we substitute w and 
w f for w and is then given by the second term in (18.20). We note that this 
increase depends on the weights only through w f /w = p 一 1 and w/w f . We note 
that if w f /w = w r /w then F = 1 and hence there is no increase in variance. Also, 
ifw<w , we take, following the Yates procedure, w/w , = 1. Then F = p~ l and 
hence the proportional increase in (18.20) becomes 


1 u(ot)c(a)(p~ l — l ) 2 
w p~ l s \u(cc) + c(a)p~ x 


(18.21) 


where we have used the fact that u{a) + c(a) = s. Adding over all possible 
effects and interactions, we shall obtain the increase in variance for a treatment 
comparison due to inaccuracies in the weights, which is a constant if F < p _1 
and is a function of F if F > p~ l . The percentage loss in information due to 
the inaccuracies in the weights may then be obtained. We shall illustrate this by 
continuing Example 18.1. 


Example 18.1 (Continued) Using the results of Section 18.4 and (18.20) and 
(18.21), we find the following increases in 


Vo ： 


Vi ： 


V 2 ： 


and 


2 1 
9 w 

2 1 
9 w 

2 1 
9 w 


av. var: 


2 

13 w 


3.1.2 

1 

3.2.1 

( J F + 2p- 1 ) 2 (l+2p- 1 ) 

(2F + p-i)2(2 + p-i) 

2.1.2 

1 

5.2.1 

( J F + 2p- 1 ) 2 (l+2p- 1 ) 

(2F + p- 1 ) 2 (2 + p- 1 ) 

1.1.2 

1 

4.2.1 

(F + 2p-!) 2 (l + 2p-l) 

(2F + p-i)2(2 + p-i) 

1 r 3i - 2 

6.2.1 

丄 


■p-\F-\) 2 

■p-'iF-l) 2 

■p~\F-l) 2 


(F + 2p- 1 ) 2 (l+ 2p-i) (2F + p- 1 ) 2 (2 + p- 1 ). 
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for F > p _l , and 


2 1 


2 1 


3.1.2 


3-2.11 (p- 1 - l) 2 


1 +2p-i 

+ 2 + P-1 」 

3p - 1 

2.1.2 

5.2.1 1 

1 

(〆-l) 2 

1 +2p-i 

2 + p-iJ 

3p - 1 

1-1-2 

4.2. 1 1 

1 

(〆-l) 2 

1 +2p-i 

2 + p-iJ 

3p - 1 

" 3.1.2 

6.2.1 ■ 
■ + _ ^ 

l (p _1 - 1) 2 


13 w [1+2P- 1 2 + p- 1 


If we assume a Gauss-Markov normal linear model (GMNLM) and if we use 
w = l /MS (Residual) 


w r = 1 /MS(Remainder) = l/MS(Component a) (18.22) 

[see (11.90)]，then 

w’ w 
F = __ . — 

w f U) 

follows an F distribution with v and vr d.f. (see Table 11.2). This would allow us 
to determine, for a given p, the expected increase in variance due to inaccuracies 
in the weights. If instead of (18.22) we use 

w f = l/MS(X p *\3,X p X r ) 

matters become more complicated because of the heterogeneity of MS (X^* | 3, 
X p , X T ), that is, because the different components of MS (X^* \ 0, X p , 
X r ) have different expected values (see Table 18.6). For details we refer to 
Kempthome (1952). 

The results published by Yates (1939) and Kempthome (1952) indicate that 
the effect of inaccuracies of the weights on the variance of treatment comparisons 
is of the order of 1, 2, or 3% and, therefore, trivial. The effect of inaccuracies 
of the weights will increase as the number of degrees of freedom for interblock 
error [v or s(K n ~ l — 1)] and intrablock error (v/^) decrease. Usually the number 
of degrees of freedom for intrablock error is sufficiently large, and as a practical 
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rule with lattice designs, it is probably safe to say that the effect of inaccuracies 
will be inappreciable if the interblock error is based on 10 or more degrees of 
freedom. With a smaller number of degrees of freedom, it is best to assume that 
u/ = Q and to utilize intrablock information only. 

18.6 ANALYSIS OF LATTICE DESIGNS AS RANDOMIZED 
COMPLETE BLOCK DESIGNS 

Yates (1939) has shown that certain types of lattice designs may be analyzed as 
randomized complete block designs (RCBD). His method of proof depends on 
the fact that, if, under a particular analysis, the expectation of the treatment mean 
square, in the absence of treatment effects, is equal to the expectation of the error 
mean square, this analysis gives an unbiased estimate of the error variance for 
treatment comparisons. This, however, is not necessarily true, for the error sum 
of squares must be homogeneous. In this section we shall discuss briefly to what 
extent such an analysis is valid. 

Suppose we have an experiment on K n treatments in a lattice design with 
blocks of size K and s replicates. For the analysis as an RCBD we use the 
model 

y = /xO X p p + X T r + co (18.23) 

with 

(o = X^^ +e (18.24) 

and the analysis of variance as outlined in Table 18.7 (see Section I.9.2.4). 
Using the quasi-factorial approach as employed for lattice designs, we can par¬ 
tition SS(X T I 3, X p ) into (K n — l)/(K — 1) components E a (ot e A) each with 
K — \ d.f. and similarly SS(/ | 0, X p , X r ) into (K n — l)/(K — 1) components 
E 01 x reps (a G A) each with (s — l)(K — 1) d.f. We shall now consider these 
components and their ex£ected mean squares. 

Consider E a and let denote the average response of all treatment combi¬ 
nations in replicate j satisfying the equation = ^ expressed as deviation 

from the average response of replicate j. Note that if E a is not confounded in 


Table 18.7 ANOYA Structure 
for Lattice Design as RCBD 


Source 


d.f. 

X p \3,X x 

s —: 

1 

X t \3,X p 

K n - 

- 1 

I\3,X p ,X t 

(s - 

1 )(/T - 1) 

Total 

sK n 

-1 
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replicate j, then (/ = 0, 1, … ， [― 1). We know (see Section 11.10.3) 

that, under model (18.23), the sum of squares for E a is 


SS(E a )= 



E? 


(18.25) 


If E a is unconfounded in u(a) replicates and confounded in c(a) replicates, 
with u(a) + c(a) = s, then the expected value of [ ^] 2 for each i = 
0, l,..., K — l, using (18.23) with r = 0 and (18.24) is 


c ⑻ 


K n ~ l 


K 


K 


n—2 


~K^ 


K 


n—\ 


K 


n—2 \ 


欠 2 N 


K n ~ l 


K 


K n 


-l 


+ 巧(广 


K 


n—2) 




C(CC)(K - 1) 2 , 5(^-1) 2 


K 


八 _ ” 2 


K n 


Hence 


E[SS(E a )] = (K 一 1) 


c(a) 


Kaj 


(18.26) 


We now consider E a x reps and its associated sum of squares, SS(£ a x reps) 
say, which is a component of SS(7 | 3, X p , X T ), accounting for (s — l) (K — 1) 
d.f. This sum of squares, in turn, can be partitioned into three components as 
presented in Table 18.8. It follows then that 


E[MS(E a x reps)] 


+ 


u(a) 


+ c(a) - 1 


Kcyj 


C -^Kal 


(18.27) 


Table 18.8 Partitioning of SS (/ \0, X p9 X T ) 


Source 

d.f. 


E(MS) 

E a vs. E a 

K- 1 


^ + UiU) K^ 

E a x reps 

[u(a)-l](K- 

1 ) 


E a x reps 

[c ⑻ - 1] ( 尺 - 

1 ) 

+ 
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It follows further from (18.26) and (18.27) that, with r = 0, MS(E a ) and 
MS(E a x reps) and hence 


and 


MS(/|3 ， X p ’X r ) = E 

CL 


SS(E a x reps) 
(s - 1)(^ - 1) 


have the same expected value. 

An ordinary RCBD analysis of a one-restrictional lattice design is, therefore, 
valid insofar as the expectation of the treatment mean squares is the same as the 
expectation of the RCBD error mean square. It follows then, from consideration 
of the randomization test, that the RCBD analysis gives a reasonably valid test 
of the null hypothesis that there are no differences among treatment effects. A 
test may also be made, of course, with the intrablock treatment and error mean 
squares. Difficulties do arise, however, as soon as we wish to ascribe variances to 
treatment comparisons obtained from the ordinary treatment averages, because 
we have shown (see Table 18.8) that the error sum of squares, SS( I \ 0, X p , 
X T ), with (5 ■ — l)(K n — 1) d.f. is not homogeneous if is greater than zero. 

To illustrate the problem we consider the comparison of two treatments. Based 
on the RCBD analysis the difference of the treatment effects is estimated by the 
corresponding treatment means, with estimated variance (2/s) MS(I | 3, X p , X T ). 
Now, from (18.24), 


2 . 


-E[MS(I\0,X p ,X r )] = - 


2 K 


s K n 


E 


6 字 W 


K 


n—l 


K n 


~ Ka l 


= ~ °e _\) (18.28) 

since c (°0 = s{K n ~ x — l)/(K — 1) as (K n ~ l — l)/(K — 1) sets of [ — 1 
d.f. are confounded in each replicate. We now compare (18.28) with the true 
variance of the difference of two treatment means. Suppose that the two treat¬ 
ments occur together in a block in, say, q out of the s replicates. Then, using 
(18.23) and (18.24) the variance is given by 

長 \sa] + (5 - 咖 = 警卜 2 + 0 f)] (18-29) 
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Hence the difference between the expectation of the estimated variance (18.29) 
and the true variance (18.28) is 


2\q K -ll 2 
一 s ~s~ K n ^ 


(18.30) 


which can be either positive or negative: If (18.30) is positive, we overestimate the 
variance; if (18.30) is negative, we underestimate the variance. As an illustration 
we consider again Example 18.1. 


Example 18.1 (Continued) It follows from Table 18.4 that there are now 
two types of comparisons with either q = 0 or q = 1. We find for q = 0: (18.30) 
equals —^gcr^<0,q = l: (18.30) equals | \ ~ > 0, so that one vari¬ 
ance is underestimated and the other variance is overestimated. □ 


In practical situations the decision whether to analyze a lattice design as an 
incomplete block design or as an RCBD is often based on the efficiency of 
lattice designs relative to the RCBD. In light of what has been said above this, 
too, may be a questionable procedure, since the efficiency is based on (18.28). 
More specifically, the efficiency of a lattice design relative to the RCBD is given 
by the ratio of (18.14) over (18.28), that is, 


E = 


K -~ 1 ~ 1 +K n ~\K-\) 
P 一 1 


s 

s(K-l) ^ 
/ = 1 


rrii 

/ + 0 - i)p - 1 


(18.31) 


where the numerator is obtained by rewriting (18.28) in terms of p~ l and w. 
For different values of p and various lattice designs Kempthome (1952) has 
tabulated E. As is to be expected, E increases quite rapidly with increasing p, 
which would indicate that in most situations an RCB analysis is not warranted. 
In a practical situation this question may be decided bousing p in (18.31) and 
computing E, say, and do a RCBD analysis if 1.00 < £ < 1.05. 


18.7 LATTICE DESIGNS AS PARTIALLY BALANCED 
INCOMPLETE BLOCK DESIGNS 

In the preceding sections we have discussed one-restrictional lattice designs with 
K n treatments in blocks of size K as they relate to factorial experiments. This 
discussion presupposes that K is a. prime or a prime power although we did not 
mention this explicitly. We know, however, from Table 18.3 that certain lattice 
designs exist even if K is not a prime or prime power. For these cases it is useful 
to know that lattice designs are PBIB designs and hence can be constructed and 
analyzed as such. 
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Let us consider the one-restrictional two-dimensional lattice designs. We can 
arrange the K 2 treatments in a K x K array. A simple lattice can then be con¬ 
structed by taking the rows of this array as the blocks for one replicate and the 
columns of the array as the blocks for the second replicate. We can check easily 
that this results in a PBIB(2) design with a Latin square association scheme, 
that is, a L 2 -PBIB( 2 ) design (see Section 4.6.3). The relevant parameters of 
this design are t = K 2 , r = 2, k = K，b = 2K, = 1, A 2 = 0. Accordingly, 

two treatments are compared with variance V\, say, if they occur together in 
the same block, and with variance V 2 , say, if they do not occur together in the 
same block. The reader should check that this agrees with the result obtained 
from (18.13). 

Using the same K x K array we can obtain triple, quadruple, and so forth 
lattices by superimposing one, two, and so on mutually orthogonal Latin squares 
(if they exist), respectively. Blocks of size K are then formed by the rows, the 
columns, and the letters of each of the superimposed MOLS, respectively. Thus 
the triple lattice represents an L3-PBIB(2) design with parameters t = K 2 , r = 3, 
k = K，b = 3K, X\ = 1 ， 入 2 = 0; the quadruple lattice an [4_PBIB(2) design with 
parameters t = K 2 , r = 4,k = K，b = 4K, = 1, ^2 = 0; in general the /-tuple 

lattice an L/-PBIB(2) design (see Section 4.6.3) with parameters t = K 2 , r = i, 
k = K, b = iK, = 1, X 2 = 0. 

The triple three-dimensional lattice with t = K 3 treatments in blocks of size 
K represents a PBIB(3) design with the cubic association scheme (Section 4.6.8). 
Following the geometric interpretation of that association scheme and denoting 
the three axes by x, y, and z, respectively, the arrangement in blocks is such that 
for the first replicate the treatments in a block lie on the same x axis, that is, are of 
the form {(x, y, z): x = 0, 1 ， ... ，尺 一 1} for fixed j and z (y, z = 0, l,..., K — 
1 )，for the second replicate the treatments in a block lie on the same y axis, that 
is, are of the form {(x, y, z): ;y = 0 , 1 ， ... ， ^ — 1 } for fixed x and z(x, z = 
0, l,..., K — l), and for the third replicate the treatments in a block lie on the 
same z axis, that is, are of the form {(x, y, z): z = 0 , 1 ， — 1 } for fixed 
x and y (x, y = 0, l,..., K — 1). For an illustration we refer to Example 18.4 
(with an appropriate renumbering of the replicates). It follows then immediately 
that this design has the parameters t = K 2 , r = 3, k = K, b = 3K 2 , 入 1 = 1, 
入 2 = 0 , 入 3 = 0 . 


18.8 LATTICE DESIGNS WITH BLOCKS OF SIZE K l 

The most commonly used lattice designs are those for K n treatments in blocks 
of size K. There are, however, situations where blocks of size K l (1 < i < n) 
are more appropriate. There are no difficulties extending the previously discussed 
methods to this case: 

1. If n is a multiple of l, say n = ml, then we can represent the experiment 
as one of m factors with K l levels and employ the methods described in 



TWO-RESTRICTIONAL LATTICES 


671 


Sections 18.3 and 18.4 [using GF(^T^) arithmetic for K prime as discussed 
in Appendix A or Section 18.7]. 

2. If n is not a multiple of l, then we employ appropriate systems of con¬ 
founding as presented in Chapter 11 for K prime. The practical problem 
is to choose out of all possible systems of confounding a system for each 
replicate such that the confounding is distributed as evenly as possible 
over all the effects and interactions and such that no effect or interaction 
is completely confounded over the entire design. The second requirement 
is necessary for the design to be connected with respect to the intrablock 
analysis. For more details and examples of practically useful designs the 
reader is referred to Kempthorne and Federer (1948b). 


18.9 TWO-RESTRICTIONAL LATTICES 

In Section 9.8 we considered the problem of double confounding in a 2 n factorial 
experiment as a device for eliminating heterogeneity in two directions using 
incomplete blocks. This principle can be extended to other factorial experiments 
and can be applied also to construct two-restrictional two-dimensional lattices, 
which are also referred to as lattice squares and lattice rectangles (Yates, 1940b). 
They represent special types of resolvable row-column designs (see Section 6.6). 

We shall consider designs for K 2 treatments arranged in K rows and K 
columns. To construct suitable arrangements we distinguish two cases: (1) K 
prime (or prime power) and (2) K not prime. 


18.9.1 Lattice Squares with K Prime 

For K prime (or prime power) we can use the correspondence between the 
t = K 2 treatments and the treatment combinations of the K 2 factorial experiment 
to partially confound any or all of the K l effects and interactions, E a ，with 
rows or columns in s squares of order K. If s = K l, we confound each 
E 01 once with rows and once with columns yielding a balanced lattice square. 
If s = (K 1)/2, we confound each E a once with rows or columns giving a 
semibalanced lattice square. If 2 < ^ < (^ + 1)/2 we confound any E 01 at most 
once with rows or columns, and if + 1)/2 < 5 < K, we try to confound the 
E a as equally as possible with rows and columns. 

The analysis of lattice squares is based on the linear model 


y = /x3 H - X s s H - X r r -|- X c c H - 1C-\- € (18.32) 

where s, r, and c represent the effects of squares (replicates), rows within squares, 
and columns within squares, respectively. For the combined analysis r and c are 
considered to be vectors of random variables with E(r) = 0, var(r) = /a r 2 , 
E(c) = 0 , var(c) = Ia^. 
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Following the notation of Section 18.4, we can write out the intrablock and 
the combined estimators for a linear contrast, ^2 cm of treatment effects as 
in (18.10) and (18.11) ， respectively. In particular, the combined estimator — 
r'k' for a simple treatment comparison, tk — is given by (18.12). To find 
var(?^ — T'k') we denote by c r (yi) and c c (yi) the number of squares in which 
E y ^ e £2 is confounded with rows and columns, respectively, with c r (yi) + 
c c (yi) = c(yi) being the number of squares in which E ye is confounded in the 
design. Further, let 


w r 


+ Ka} 


1 




+ Ka c 


(18.33) 


denote the reciprocals of the interrow and intercolumn variances, and 


w w 

Pr = - Pc = - 

U) r U) c 


We can then extend (18.13) easily and write 


var [% -%] = var [r"(x) - f(z)] 


2crj 


XI [(^(y^) + c r(yt)p7 x + C c{yi)pc l ) K 

q(x,z) 


+ J2 [sKrl (18.34) 


We shall consider (18.34) for the special case of a balanced lattice square. 
Recall (see Table 18.3) that for a balanced lattice square s = [ + 1 such that 
each effect and interaction is confounded once with rows and once with columns, 
which implies that £3 = 0 , £ 2 = ^ c r (y £ ) = 1 = c c (y £ ), u(y^) = [ — 1 for all 
E yi e £ 2 , z) consists of K interactions for each (x, z). It follows then from 
(18.34) that 

var [?(x) - ?(z)l = - ^― ； - r (18.35) 

for any two treatments (x, z), as one would expect for a balanced design. Obvi¬ 
ously, (18.35) is also the average variance of all such treatment comparisons, 
which is an extension of (18.14) for this special case. 

In order to complete the actual analysis, we still have to estimate w, w r , 
and w c . This is easily accomplished with the help of ANOVA tables as given 
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Table 18.9 T\R, C-ANOVA for Lattice Square 


Source 


d.f. 

x s \o 

5 - 1 


Xr 1 0, X s 

s(K - 1) 


x c \o,x s 

s(K - 1) 


X z \3,X s ,X r ,X c 

K 2 - 1 


I\D,X s ,X r ,X c ,X r 

(K- l)[s(K 

— 1) — (^ + 1)] = Vr 

Total 

sK 2 - 1 



Table 18.10 R | C, T- and C | R, T-ANOVA for Lattice Square 

Source 

d.f. 

Source 


s — l 


1 0, X s 

K 2 - 1) 

x T 1 J, X s 

X C \J,X S ,X T 

s(K - 1) 

X r \3,X s ,X z 

j(^)cc VS - (扣 U 


\(E yt ) cr vs- (^)„ r ) 

x r \o,x s ,x c ,x T 

s(K - 1) 

X c \3,X s ,X r ,X r 

{« vs. ㈣ } 


j(^) cc vs. (^)) 

I\3,X Si X ri X c ,X r 


I\3,X s ,X r ,X c ,X z 


in Tables 18.9 and 18.10. In Table 18.9, the treatment-after-rows-and-columns 
ANOVA or TI R, C-ANOVA, the sums of squares are obtained in the familiar 
fashion, where the components of 

SS(X T |a, X x ,X r , X c ) = SS(A) + SS(B) + SS(AB) + ... + 
are obtained from the intrablock analysis (see Section 11.10) and 
SS(Residual) = SS(I \ X s , X r , X c , X T ) 
is obtained by subtraction. 

The row-after-treatments-and-columns or R | C, T-ANOVA and the column- 
after-treatments-and-rows or C | R, T-ANOVA, as outlined in Table 18.10，are 
used to estimate w r and w c , respectively. In order to do so it is useful to com¬ 
ment briefly on the form of some of the sums of squares in Table 18.10. For 
Ey^ g £2 let u r {y t ) = s — c r (y^) and u c (y^ = s — c c (y £ ) denote the number 
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of squares in which E Yi is unconfounded with rows and columns, respectively, 
and let [E y ^ cc , [E y ^ cr , [E y ^ uc , [E y 8 l ^ ur be the estimate of E y ^ obtained from 
those squares in which E ye is confounded with columns, confounded with rows, 
unconfounded with columns, unconfounded with rows, respectively. It is then 
easy to verify that, in Table 18.10, 


SS(X C |3 ， H) = 

= ss[{« 

vs- ㈣ J] 

(18.36) 

ss(mm) = 

= ss[{(^) cr 

vs. ( 則 }] 

(18.37) 

SS(X,.|3 ， H) = 

= ss[{(^) cr 

vs. ㈣ 』 

(18.38) 

SS(X c |3 ， m) = 

= ss[{(^) cc 

vs. ( 則 }] 

(18.39) 


The RHS of (18.36)-(18.39) can be computed following (11.93) with the proper 
modifications, that is, computing the quantities X y ^ 1 from the squares as indicated 
in each expression and then modifying c(yi) and u{y accordingly [this includes 
replacing s by c r {y t ) + u{y t ) in (18.38) and by c c (y t ) + u{y t ) in (18.39)]. With 
regard to estimating w r and w c , the sums of squares (18.38) and (18.39) and their 
expected values are needed. Again, these are easily obtained from (11.94) as 


where 


E [MS(X r I H ， X c ， X x )] = a] + A r Ka} (18.40) 


A,. 


E 

^2,r 


<yd 

(u{y + c r (y t ) 


(18.41) 


and denotes summation over all E y ^ e £ 2 , which are confounded with 

rows, and 


where 


E [MS(X c |3,Z s ,Z r , Z r )] = a] + A c Ka] 


A c 


E 

^2,c 


<Yi) 

{u{y-\-c c (y t ) 


(18.42) 

(18.43) 


and denotes summation over all E y ^ e S 2 , which are confounded with 

columns. We then obtain 


i = MS(I\0.X s ,X r ,X c ,X r ) 


(18.44) 
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from (18.40) and (18.44) 

=(i - 去 ) Ms(m m) 

△r 

and from (18.42) and (18.44) 

of + K 贫 = 去 =(1 - 去 ) MS (/ \0,X s ,X r ,X c ,X r ) 

△c 

We illustrate the above by the following example. 

Example 18.2 Suppose we have t = 5 2 treatments in an unbalanced lattice 
square with 5 = 4 squares. We can use the following system of confounding: 


a^ + K^ = ^ - 




Square 



I 

II 

III 

IV 

Rows 

A 

B 

AB 3 

AB 

Columns 

AB 

AB 2 

AB 4 

AB 3 


The experimental plan (apart from randomization of rows and columns) is then 
as given in Table 18.11, and the relevant parts of the R | C, T-ANOVA and the 
C I R, T-ANOVA are given in Tables 18.12 and 18.13, respectively. Concerning 
treatment comparisons such as, for example, (0, 0) versus (1,1)，we have 


r(0, 0) — r(l, 1 ) = ( Aq — Ai) + 


Bo - Bij 


+ 


( 爲 0 - ab 2 ) + (ab^- abI) + ( 歲 - 項 ) 


with [see (18.34)] 


var (?(0, 0) - ?(1, 1)) 


b + p,: 1 2 + p~ l +p-' 3 + p- 1 


□ 


18.9.2 Lattice Squares for General K 

We now turn to the case where K is not prime or prime power. Suppose there 
exist q MOLS of order K. Then the row and column classification and the 
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Table 18.11 Experimental Plan for Lattice 
Square with 25 Treatments in 4 Squares 


(0,0) 

(0, 1) 

Square I 

(0,2) 

(0, 3) 

(0, 4) 

(1,4) 

(1,0) 

(1, 1) 

(1,2) 

(1,3) 

(2, 3) 

(2, 4) 

(2, 0) 

(2, 1) 

(2, 2) 

(3, 2) 

(3, 3) 

(3,4) 

(3, 0) 

(3, 1) 

(4, 1) 

(4, 2) 

(4, 3) 

(4, 4) 

(4, 0) 

(0, 0) 

(1,0) 

Square II 

(2, 0) 

(3, 0) 

(4,0) 

(1,2) 

(2, 2) 

(3, 2) 

(4, 2) 

(0, 2) 

(2, 4) 

(3, 4) 

(4, 4) 

(0, 4) 

(1,4) 

(3, 1) 

(4, 1) 

(0, 1) 

(1, 1) 

(2, 1) 

(4, 3) 

(0, 3) 

(1,3) 

(2, 3) 

(3,3) 

(0,0) 

(1,3) 

Square III 

(2, 1) 

(3, 4) 

(4, 2) 

(1 ， 1) 

(2, 4) 

(3, 2) 

(4, 0) 

(0, 3) 

(2, 2) 

(3, 0) 

(4, 3) 

(0, 1) 

(1,4) 

(3, 3) 

(4, 1) 

(0, 4) 

(1,2) 

(2, 0) 

(4, 4) 

(0, 2) 

(1,0) 

(2, 3) 

(3, 1) 

(0,0) 

(1,4) 

Square IV 

(2, 3) 

(3, 2) 

(4, 1) 

(1,3) 

(2, 2) 

(3, 1) 

(4, 0) 

(0, 4) 

(2, 1) 

(3, 0) 

(4, 4) 

(0,3) 

(1,2) 

(3, 4) 

(4, 3) 

(0,2) 

(1, 1) 

(2, 0) 

(4, 2) 

(0, 1) 

(1,0) 

(2, 4) 

(3,3) 


classifications with respect to the q languages, say L\, L 2 ,..., L q , can be used in 
pairs to construct various numbers of squares, that is, if we denote the orthogonal 
classifications by R = L^+i, C = Z^+ 2 , L\,..., L q , respectively, then any pair 
(L“ Lj) with i ♦ j (i, j = 1, 2,..., ^ + 2) defines a suitable square arrangement 
of the t = K 2 treatments, where the levels of L/ determine the rows and the levels 
of Lj determine the columns of the square. 


Example 18.3 For ^ = 6 we have q = \. A suitable arrangement would be 
as follows: 




Square 



I 

II 

III 

Rows 

Li 

Lx 

L3 

Columns 

L 3 

l 2 

Li 


□ 
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Table 18.12 R|C, T-ANOVA for Example 18.2 


Source 

d.f. 

£(MS) 

x s \o 

3 


Xr\0,X s 

24 


X C \J,X S ,X T 

16 


A5i vs. A Bn,m,iv 

4 


A5u vs. ABj m IV 

4 


AB lv vs. AB l li m 

4 


AB m vs. A5 i n IV 

4 


x r \o,x s ,x c ,x T 

16 

^ + J2 ^ G r 

M vs. Aii, hi, iv 

4 

〜 2 + I 5cr r 

石 ii vs. 5i，m,iv 

4 

〜 2 + I 5or r 2 

Ab\ iy vs. AB^h 

4 

a e + \ 5or r 2 

AB\\ vs. A Bum 

4 

+ f 5or r 2 

I\0,X s ,X ri X Ci X T 

40 



Table 18.13 C | R, T-ANOVA for Example 18.3 


Source 

d.f. 

£(MS) 


3 



24 


x r \o,x s ,x T 

16 


M vs. An, hi, iv 

4 


Bn vs. 5i，m，iv 

4 


AB m vs. AB i n IV 

4 


ABiv vs. ABijijh 

4 


x c \o,x s ,x r ,x T 

16 

a e + 12 

vs. A5n ,hi 

4 

cr, + | 5cr c 2 

A5u vs. ABj m IV 

4 

〜 2 + i 5or c 2 

AB m vs. AB i n IV 

4 

+ I 5cf c 

- ^ 3 —*~^3 

AB lv vs. AB l u 

4 

°e + | 5cr c 

I\0,X s ,X r ,X c ,X T 

40 

〜 2 
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Example 18.4 

be as follows: 


For 尤 =10 we have q = 2. A suitable arrangement would 




Square 



I 

II 

III 

IV 

Rows 

u 

乙 2 

l 3 

l 4 

Columns 

l 2 

乙 3 

u 

Li 


Other arrangements are, of course, possible. 


□ 


18.10 LATTICE RECTANGLES 

The concept of the lattice square as a design that enables elimination of het¬ 
erogeneity in two directions can be extended to situations where the rows and 
columns do not contain the same number of experimental units. The notion of 
double confounding presented in Section 9.8 may be used to develop designs for 
K n treatments, K being prime or prime power, with replicates having a rect¬ 
angular pattern with rows of size K r and columns of size K c with r -\- c = n. 
There are, obviously, many possible configurations. To describe the general case 
leads to rather complicated notation. For this reason we shall illustrate this type 
of design by the following example. 

Example 18.5 Suppose we have t = 3^ = 21 treatments and replicates with 
rows of size 3 and columns of size 9. A possible system of row and column 
confounding would be as follows: 


Replicate I: 

Rows: 

A,B,AB,AB 2 


Columns: 

ABC 

Replicate II: 

Rows: 

A,C, AC, AC 2 


Columns: 

AB 2 C 

Replicate III: 

Rows: 

B,C,BC,BC 2 


Columns: 

ABC 2 


This leaves each effect or interaction unconfounded in at least one replicate. □ 

With proper modifications, the analysis proceeds as outlined in Section 18.9 
using model (18.32) and the weights 

1 1 

a} + K r o} o} + K c a c 2 

Estimation of the weights is accomplished easily by following the procedures 
outlined in Section 11.12 in combination with (18.41) and (18.43). We merely 
illustrate this for Example 18.5. 
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Table 18.14 Partial ANOVA for Lattice Rectangle 
of Example 18.5 


Source 

d.f. 

E(MS) 

x r \o,x s ,x c ,x T 

24 , 

a e + | 3or r 2 

Mi,in vs. Am 

2 


Bun vs. B\\ 

2 

a l + 5 3or r 2 

Cii,iii vs. Ci 

2 


vs. A^iijn 

2 


^45! vs. AB ll ul 

2 


ACn vs. ACijn 

2 


ACh vs. AC i m 

2 

+ | 3 (T ； 

BC\n vs. BCm 

2 


BC m vs. BC I U 

2 


A\ vs. An 

2 


B\ vs. Bui 

2 

+ 3ff r 2 

Cn vs. Cm 

2 


X c \3,X s ,X r ,X r 

6 

^ 2 + |9^ 2 

ABC\ vs. ABCii in 

2 


AB^Cu vs. AB^Ciju 

2 

a} + | 9or f 2 

一 * — 2 -—— 2 



ABC m vs. ABC l u 

2 



Example 18.5 (Continued) The partitioning of SS(Z r | 3, X s , X c , X T ) and 
SS(Z C I 3, X s , X r , X r ) is indicated in Table 18.14 together with the associated 
£"(MS). With these, w r and w c can be estimated in the usual way. □ 


18.11 RECTANGULAR LATTICES 

The class of two-dimensional lattices is applicable when the number of treatments 
is a perfect square; that is, ? = K 2 . Obviously, this limits the number of cases in 
which such a lattice design can be used. To remedy this deficiency Harshbarger 
(1947, 1949 ， 1951) introduced designs for t = K(K — 1) treatments in blocks 
of size K — l, which are referred to as rectangular lattice designs. Actually, 
this is a special case of a more general class of designs for t = K(K — L) 
treatments in blocks of size K — L (K > L). However, only the case L = 1 is 
really useful from a practical point of view since (K — l) 2 < K(K — 1) < K\ 
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thus accommodating intermediate values of t. In contrast to the lattice designs 
discussed so far there does not now exist a correspondence between the treatments 
and factorial combinations, a fact that will be reflected in the analysis of these 
designs. 

18.11.1 Simple Rectangular Lattices 

We denote the treatments by ordered pairs (x, y) with x,y = 1,2,..., K, x ^ y. 
We then form two replicates, each with K blocks of size K — l sls follows: 

Replicate I: Treatments with the same value for x form the xth block 
Replicate II: Treatments with the same value for y for the yth block 

Thus each treatment (x, y) appears together in a block with 2(K — 2) treat¬ 
ments all of which have one digit in common with (x, y). The remaining 
K 2 — 3K + 3 treatments have either 0, 1, or 2 digits in common with (x, y). 
Nair (1951) has shown that these groups of treatments define an association 
scheme for a PBIB(4) design for > 4 with the actual resolved PBIB(4) design 
as given above with t = K(K — 1), k = K — l, r = 2, b = 2K. 

More formally, the association scheme and the parameters of the design (see 
Section 4.3) are as follows: Two treatments (x, y) and (x\ 〆）are said to be 

1st associates if (x = x\ y ^ y r ) or (y = y f , x ^ x f ) 

2nd associates if x ^ x\ x ^ y\ y ^ x f , y ^ y r 
3rd associates if (x = y\ y x f ) or (y = x\ x ^ y f ) 

4th associates if x = y\ y = x' 

Thus n x = 2(K - 2), n 2 = (K - 2)(K - 3), n 3 = 2(K - 2), n 4 = 1 with A，i = 

1, 入 2 = 入3 = 入4 = 0. 

Furthermore 

"0 1 0 0 0 

1 K-3 K-3 1 0 

Pi = 0 K-3 (K-3)(K -4) K-3 0 
0 1 K-3 K-3 1 

_0 0 0 1 0 

"0 0 1 0 0 

0 2 2(K - 4) 2 0 

P 2 = 1 2(K - 4) (^" -4 )(^： -5) 2(K - 4) 1 

0 2 2(K - 4) 2 0 

0 0 1 0 0 
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"0 

0 

0 

1 0~ 




0 

1 

K-3 

K-3 1 



尸 3 = 

0 K-3 

(K -3)(K -4) 

K-3 0 




1 K-3 

K-3 

1 0 




Lo 

1 

0 

0 0_ 




"0 

0 

0 

0 


r 



0 

0 

0 

2(K - 2) 

0 


尸 4 = 

0 

0 

(K -2)(K - 

3) 0 


0 



0 2(K - 

2) 0 

0 


0 



1 

0 

0 

0 


0 



For K = 3 this design reduces to a PBIB(3) design since then = 0. 
The analysis of this design proceeds as outlined in Section 4.5. 


18.11.2 Triple Rectangular Lattices 

We now denote the treatments by triplets (x, y, z) with x,y, z = 1,2,K and 
x ♦ y ♦ x. The triplets are chosen in the following way. We take a Latin 
square of order K, replacing the Latin letters A, C,... by the “Latin” numbers 
1, 2, 3, … ， respectively, arranged in such a way that the diagonal contains the 
numbers 1, 2,…， [.Then leaving out the diagonal the remaining K(K — 1) 
cells are identified by the row number, x, the column number, y, and the “Latin” 
number, z. Each such cell corresponds to a treatment (x, y, z), and the treatments, 
which are allocated to the blocks in the three replicates as follows: 


Replicate I: Treatments with the same x value form the xth block 

Replicate II: Treatments with the same y value form the yth block 

Replicate III: Treatments with the same z value form the zth block 


giving a resolved design with parameters t = K(K — 1), k = K — l, b = 
3K, r = 3. 

Nair (1951) has shown that for 尤 = 3 and 欠 = 4 the resulting design is a 
PBIB design, but that this is no longer true for K > 5. For [ = 3 we have a 
PBIB(2) design with the following association scheme: Two treatments (x, y, z) 
and (x\ y\ z f ) are said to be 1st associates if x = x f ox y = y r or z = z f and 2nd 
associates otherwise. It then follows that n\ = 3, 行 2 = 2, 入 i = 1, 入 2 = 0 and 

/0 10 、 /0 0 1\ 

Pi = 1 0 2 P 2 = 0 3 0 

\0 2 0/ \1 0 1/ 
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For K = 4, the association is as follows: Two treatments (x, y, z) and (x\ y\ z r ) 
are said to be 1 st associates ii x = x f or y = y r or z = z f , 2 nd associates if they 
have three digits alike, and 3rd associates otherwise. This leads to a PBIB(3) 
design with parameters n\ = 6 , 沒 2 = 2, 行 3 = 3, 入 i = 1, 入 2 = 入3 = 0 and 


/o 

1 

0 

o\ 


(o 

0 

1 

o\ 


(o 

0 

0 

1\ 

1 

2 

1 

2 

尸 2 = 

0 

3 

0 

3 

尸 3 = 

0 

4 

2 

0 

0 

1 

0 

1 

1 

0 

1 

0 

0 

2 

0 

0 

V° 

2 

1 

V 



3 

0 

V 



0 

0 

V 


The analysis of these triple rectangular lattices follows the procedures given in 
Section 4.5. For K > 5 these designs can be analyzed according to the methods 
developed in Chapter 1 for the general incomplete block design. It should be 
mentioned, however, that for [ = 5 treatments are compared with five different 
variances depending whether they ( 1 ) appear together in the same block and 
have either 1 ， 2， or 3 digits alike or (2) do not appear in the same block and 
have 1 or 2 digits alike. It appears then that this design has a structure that 
is more general than that of a PBIB design (see also Nair, 1951) but that is 
unknown. 

The method used to construct triple rectangular lattices can be generalized 
to construct rectangular lattices with more than three replicates by using several 
MOLS (where available) to label the treatments appropriately. For K — 2 MOLS 
(if they exist) we obtain the near balance rectangular lattices with K replicates 
(Harshbarger, 1951). 


18.12 EFFICIENCY FACTORS 

It should be clear from our discussion in the preceding sections that the one- 
restrictional lattice designs are certain types of incomplete block designs, and 
that the two-restrictional lattic designs are resolvable row-column designs. For 
both of these types of designs we have discussed in Sections 1.12 and 6.6.7, 
respectively, how comparisons among competing designs can be made by com¬ 
puting their efficiency factors, and by providing upper bounds for the efficiency 
factors. These results can, of course, be used here, too. 

More specifically, however, Patterson and Williams (1976) give the efficiency 
factor for a square lattice as 


E _ (K 2 - 1)(^ - 1) 

_ (/: 2 - 1)(5- 1) + ^(^- 1) 


(18.45) 


where s is the number of replicates or number of different systems of con¬ 
founding used. For other one-restrictional lattices (18.45) represents an upper 
bound. 
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For a two-restrictional lattice John and Williams (1995) give an upper bound 

t _ p _ v + 1 

E py = — ； (18.46) 

t _ 丄 

where t = number of treatments, p = number of rows, and y = number of 
columns. Alternatives to (18.46) were derived by John and Street (1992, 1993). 



CHAPTER 19 


Crossover Designs 


19.1 INTRODUCTION 

We have mentioned earlier (e.g., 1.9.1) that a very important aspect of experi¬ 
mental design is the reduction of error in order to improve the inference from 
experiments. This is particularly true if an experiment involves biological entities, 
such as animals or humans, since such entities typically exhibit rather larger vari¬ 
ability. One way to reduce this natural variability is to use each animal or human, 
which we shall refer to as subjects, as a block rather than as an experimental 
unit. Different treatments are then applied successively, that is, in different time 
periods, to each subject, so that each subject-period combination now represents 
the experimental unit. This is often referred to as each subject acting as its own 
control, since now comparisons between treatments can be made within rather 
than between subjects. 

Obviously, for this procedure to be of any value certain conditions have to be 
fulfilled: (1) a subject reacts to the treatment soon after the treatment has been 
applied, (2) the treatment effect only lasts for a limited time, (3) after this time the 
subject is restored to its original state, and (4) the treatment effects are the same 
in each period. If these conditions are satisfied, we may use some form of block 
design where the subjects are the blocks and the treatments to be administered 
to the subject are applied at random. If, however, period effects are suspected, 
that is, the subjects change systematically over the time of the trial, then we 
employ some sort of row-column design, with rows representing the periods 
and columns representing the subjects. This situation may occur, for example, 
in a dairy cattle feeding trial, where the treatments are applied during the cow’s 
lactation period and where it is known that changes occur during the lactation 
period regardless of the treatments (Cochran, Autrey, and Cannon, 1941). 

It is this latter type of situation that we are concerned with in this chapter. The 
designs suitable for these situations are called crossover designs or changeover 
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designs, and we shall discuss some of their features in the following sections. 
We should mention that these designs are sometimes referred to as repeated 
measurements designs, but we shall reserve this term for designs where repeated 
measurements are obtained on a subject following a single treatment application 
(see 1.13.7). ^ 

Crossover designs were first considered in principle in the context of agricul¬ 
tural experiments. Cochran (1939), for example, alluded to their special features 
in connection with rotation experiments. Further developments came with appli¬ 
cations in animal feeding trials (e.g., Lucas, 1957; Patterson, 1951; Patterson and 
Lucas, 1959, 1962), biological assay (e.g., Finney, 1956), pharmaceutical and 
clinical trials (e.g., Senn, 1993)，psychology (e.g., Cotton, 1998), and industrial 
research (e.g., Raghavarao, 1990). For a brief history of this subject see Hedayat 
and Afsarinejad (1975). 


19.2 RESIDUAL EFFECTS 

We have pointed out above that one of the advantages of crossover designs is 
that certain treatment contrasts may be estimated more precisely on a within- 
subject basis as compared to designs where only between-subject information 
is available. There are, however, also disadvantages, such as the length of time 
required for a trial. 

Another major disadvantage may arise if the treatments exhibit effects beyond 
the period in which they are applied. These lingering effects are referred to as 
residual effects or carryover effects, these effects cannot be accounted for, 
they may bias the estimates of contrasts among treatment effects or, as they also 
are referred to, direct effects. Wash-out periods (with either no treatment or a 
standard treatment) have been used to eliminate this problem, but that may in 
some cases be unethical and it also prolongs the duration of the trial even more. 

There may be situations in which one is interested to estimate the residual 
effects, but generally we are interested only in the direct effects. It is there¬ 
fore important to construct designs that allow us to estimate the direct effects 
separately from the residual effects. 


19.3 THE MODEL 

We have argued before (see 1.2.2) that the development of the experimental design 
and the formulation of an appropriate statistical model are intimately connected 
in that the structures of the treatment design, the error control design, and the 
sampling and observation design determine essentially the complexity of the 
statistical model for purposes of analyzing the data. From our discussion, so far it 
is clear that a model for crossover designs follows that for a row-column design, 
that is, it contains period, subject, and treatment (direct) effects. In addition, 
however, we also need to include residual effects, and that requires that we have 
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to make assumptions about the nature of the residual effects. Over how many 
periods do they extend? Do they interact with the treatments applied in these 
periods? Do they change over time? 

The commonly used assumptions are that the carryover effects last only for one 
period and that they are constant over time and do not depend on the treatment 
applied in that period (but see also Section 19.8.6). Thus, if we denote by yij 
the observation in period i (/ = 1, 2,..., p) on subject j (j = 1,2,..., n), we 
write the model as 


yij = M + 兀 / + 心 + ^ij (19.1) 

where 7T/ represents the ith period effect, Pj the yth subject effect, Td(i,j) the treat¬ 
ment effect, with d(i, j) denoting the treatment applied to subject j in period 
i using the design d, Pd(i-\,j) the carryover effect associated with the treat¬ 
ment assigned to subject j in period / — 1, and e" the error with mean 0 and 
variance cfj. 

We shall rewrite (19.1) in matrix notation as follows (see Stufken, 1996): 


y = 3/x -\- X\Tt -\- X 2 P + + XdAP + e (19.2) 


with y f = (yn,y2U . • •, y P n), ^ = On, 兀 2, • • • ，兀尸 ) ， 〆 =(H •.., Pn), 

l = (T]_，T 2 ，. • . ， P — (Pi，P2，•. . ， Pr)? 





1 

Op . 

.. 0/ 




0 P . 

.. 0 P 

X\ = 

-Ip- 

= 0 x I p , X 2 = 

_0 P 

Op . 



and the pn x t design-dependent matrices as 



"^3l" 


^d4\ 


Xd3 = 

X d32 

Xd4 = 

^d42 

(19.3) 


_^d3n_ 


_^dAn_ 



where the p x t matrix Xd 3 j denotes the period - treatment incidence matrix for 
subject j and where XdAj = L*Xd 3 j denotes the p x p period-residual effect 
incidence matrix for subject j, with the p x p matrix L* defined as 



^ p—l 0/7—1 


The form of L* in (19.4) implies that Pd(0,j) = 0. 


(19.4) 
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19.4 PROPERTIES OF CROSSOVER DESIGNS 

We have seen in our earlier discussions that balancedness plays an important and 
useful role in the construction of designs. It is therefore not surprising — in fact, 
it is quite intuitive — that “good” crossover designs benefit from certain forms of 
balancedness. 

We shall denote a crossover design d with t treatments, n subjects, and p 
periods by CO(t, n, p) and the totality of all CO(f, n, p) by I n order 

to describe balancedness more precisely we then give the following definitions, 
following Cheng and Wu (1980) and Afsarinejad (1990). 

Definition 19.1 A design d e ^2t,n,p is said to be uniform on the periods if 


each treatment is assigned to 入 i subjects in each period. □ 

Definition 19.2 A design d e ^lt,n,p is said to be uniform on the subjects if 
each treatment is assigned 入 2 times to each subject. □ 

Definition 19.3 A design d e is said to be uniform if it is uniform on 
the periods and uniform on the subjects. □ 

These definitions imply that 

n = X\t p = X 2 t (19.5) 

and hence n > t, p > t. 


Definition 19.4 A design d e ^t,n,p is said to be balanced for residual effects 
(or balanced for short) if each treatment is immediately preceded by each other 
treatment (but not by itself ) 入 times. □ 

Definition 19.5 A design d e ^t,n,p is said to be strongly balanced for resid¬ 
ual effects (or strongly balanced for short) if each treatment is immediately 
preceded by all the treatments (including itself) 入 * times. □ 

These definitions imply that 

n(p — 1) = Xt(t — 1) n(p — 1) = (19.6) 

For the important case p < t Afsarinejad (1990) gives the following definitions 
for balancedness. 

Definition 19.6 A design d e Qt,n,p with p < t is said to be balanced if it 
is uniform on periods, balanced for residual effects, and distinct treatments are 
applied to the subjects. □ 

Definition 19J A design d e Q t , n , p with p < t is said to be strongly bal¬ 
anced if it is uniform on periods and strongly balanced for residual effects. □ 
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19.5 CONSTRUCTION OF CROSSOVER DESIGNS 

In this section we shall describe some methods of constructing crossover designs. 


19.5.1 Balanced Designs for p = t 

A method of constructing balanced designs for the case p = t was given by 
Williams (1949). These designs are commonly referred to as Williams squares. 
The method can be described easily by developing the treatment sequence 
assigned to subject 1 (Bradley, 1958). We distinguish between t even and t odd. 


1. t even: Let us denote the treatment sequence for subject 1 by (c\, 
C 2 , … ， c t y. Then the sequence for subject y + 1 (j = 1,2,..., t — 1) 
is given by (c\ + j, C 2 + j, … ， c t j) f where each element in this 
sequence is reduced mod t. Now, labeling the treatments 1,2,... ,t, we 
have specifically, (ci, C 2 , …， c t ) f = (1, r, 2, r — 1, 3, ..., t/2, (t + 2)/2)’. 
This yields a CO(?, t, t). 

2. t odd: Here (ci, C 2 , … ， c t )’ = (1, t,2,t — 1, 3,+ 3)/2, (t + 1)/2)’. 
Developing (c\, C 2 ,..., c t ) as in (1) yields t sequences for t subjects. 
We then take the sequence for subject t, that is, (c\ -\-t — l,C 2 ~\-t — 
l,..., c t 1 — l) r and make the mirror image of it the sequence for subject 
t l. The sequences for subjects t 2, t 3,... ,2t are then obtained by 
developing the sequence for subject ? + 1 as described above, always mod 
t. This yields a CO(t, It, t). 


Using arguments based on cyclic groups, one can show that the designs 
obtained by the methods described in (1) and (2) above are balanced crossover 
designs with A. = 1 and A, = 2, respectively, in (19.6). In addition, the CO(^, t, t) 
in (1) is obviously a minimal design since 入 i = 1 in (19.5). The same cannot 
be said for the CO(^, It, t) in (2). In fact, it is known that minimal balanced 
designs do not exist for t = 3, 5, 7 and exist only for some t odd (Denes and 
Keedwell, 1974). 

19.5.2 Balanced Designs ior p < t 

Even for moderate values of t the designs discussed in Section 19.5 may not 
always be practicable because the duration of the trial may be too long. Rather 
than using Latin square designs, as in (1) above, or multiples of Latin square 
designs, as in (2) above, we need designs that have the form of Youden squares 
(see 1.10.5) or multiples of Youden squares, that is, designs with p < t. At the 
same time it is desirable to consider minimal designs, that is, to keep the number 
of subjects as small as possible. 

We know from (19.5) and (19.6) that for a balanced CO(t, n, p) we require n = 
kit and n = kt(t — l)/(p — 1), which implies 入 i(p — 1) = X(t — 1). A minimal 
design is then obtained for 入 =1. We thus have the following theorem. 
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Table 19.1 Generating Columns for Minimal Balanced 
CO(^, n, p) 


1 

2 

3 . 

Column 

.. 入 1 

Cl 

c p 

C2p-1 . 

.. C(/_i)p_(/_2) • 

• • c (Xi —1)/7—(A ， i—2) 

Cl 

c p+i 

C2p . 


• . c (Xi-1)/7-(Ai-3) 

c p 

C2p-l 

C?,p-2 . 

• • Qp—(/—1) . 

.• Ct 


Theorem 19.1 (Afsarinejad, 1990) The necessary and sufficient conditions 
for the existence of a minimal balanced CO(?, n, p) with p < f are « = 入 J and 


We shall describe here two of several general methods of constructing balanced 
CO(t, n, p). 

Method 1 

Extending the method of Section 19.5.1，Afsarinejad (1983, 1990) proposed 
to construct a minimal balanced CO(f , 入 p) for p < t by developing (mod 
t) each of the 入 i columns of Table 19.1 associated with subjects 1, ，+ 1, 
2, + 1 ,... ，（入 i — 1)，+ 1， where 


(1, t,2,t — 1,3,..., t/2, (t + 2)/2) for t even 


C f = (Cl, C 2 , . . . , c t )= 


(1 ， t , 3, t — 2, 5,... , (t — 3)/2, 

(， + 5)/2, (t + 1)/2, (t + 5)/2, 

(，一 3)/2, … ，l 1) for ? = 4y + 1 


(1 ， t, 3, t — 2, 5,..., (t — 1)/2, 

(t + 3)/2, (t — 1)/2,1) for ? = 4y + 3 


We comment briefly on the form and nature of the columns in Table 19.1: 


1. The columns consist of interlaced increasing and decreasing sequences of 
positive integers; 

2. For t odd the column consists of three parts, say c' = {c f _ v c 。， c^), where c; 
is the mirror image of c f _ v and co = (t 1)/2 or (t + 3)/2 for ? = 4y + 1 
or ? = 4y + 3, respectively; 

3. The set 1J- =2 (q — q_i) contains each nonzero integer mod t. These repre¬ 
sent the p — I successive differences between the elements of each of the 
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入 1 columns [recall that 入 i(p — 1) = ， 一 1]. After development each column 
will lead to (p — l)t such differences, so that each ordered pair of distinct 
treatments will appear exactly once in the design, implying 入 =1. 

We shall illustrate the procedure with the following example. 


Example 19J Let t = 1, p = 4, and hence =2. From Table 19.1 we 
obtain (ci, C 2 ,... ， cj) = (1, 7 ， 3, 5 ， 3, 7 ， 1). Thus the generating columns are 
(1 ， 7, 3, 5)’ and (5, 3, 7, 1)’，and the design is given by 


Subject 


Period 



1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

1 

1 

2 

3 

4 

5 

6 

7 

5 

6 

7 

1 

2 

3 

4 

2 

7 

1 

2 

3 

4 

5 

6 

3 

4 

5 

6 

7 

1 

2 

3 

3 

4 

5 

6 

7 

1 

2 

7 

1 

2 

3 

4 

5 

6 

4 

5 

6 

7 

1 

2 

3 

4 

1 

2 

3 

4 

5 

6 

7 


□ 


Method 2 

This method is due to Patterson (1952) (see also Patterson and Lucas, 1962) and 
is based on balanced incomplete block designs. Suppose we have a BIBD (t, b, 
k, r; a). We then take the treatments in each block as the generating column for 
a balanced CO(k ， k, k) for k even or a CO(k, 2k, k) for k odd as described in 
Section 19.5.1. Adjoining these b crossover designs leads to the final CO{t, bk,k) 
with 入 =a for 众 even or CO(t, 2bk, k) with 入 = 2a for 众 odd. 

We illustrate this procedure with the following example. 


Example 19.2 Consider the BIBD (5, 5, 4, 4; 3) 

Block 

1 2 3 4 5 

1 1112 

2 2 2 3 3 

3 3 4 4 4 

4 5 5 5 5 


Recall that a balanced CO(4, 4, 4) is obtained from the generating column 
(1, 4, 2, 3)’. For a generating column (block) from the BIB design of the form 
(T\, 7*2， T^, T 4 ) we then obtain the CO(4, 4, 4) by using the association T\ = 
1, T 2 = 4, 73 = 2, 7*4 = 3. In other words, in the CO(4, 4, 4) given by 



CONSTRUCTION OF CROSSOVER DESIGNS 


691 


replace i by the associated Tj successively for each block. Part of the final 
CO(5, 20, 4) is given below: 


Subject 


Period 



1 

2 

3 

4 .. 

• 17 

18 

19 

20 

1 

1 

3 

4 

2 

. 2 

4 

5 

3 

2 

2 

1 

3 

4 

- 3 

2 

4 

5 

3 

3 

4 

2 

1 

- 4 

5 

3 

2 

4 

4 

2 

1 

3 .. 

• 5 

3 

2 

4 


□ 


19.5.3 Partially Balanced Designs 

One possible disadvantage of the designs obtained by method 2 in Section 19.5.2 
is that the number of subjects required may be rather large. This is due to the fact 
that in general the number of blocks in a BIB design is relatively large. We have 
seen earlier that one “remedy” is to resort to PBIB designs with a smaller number 
of blocks. The same idea can be used here. Instead of developing the blocks of 
a BIB design as illustrated in Example 19.2, we may choose to develop in the 
same manner the blocks of a PBIB(m) design (Patterson, 1952; Patterson and 
Lucas, 1962). As a consequence, however, comparisons between direct effects 
from the resulting crossover design are estimated with up to m different variances. 
Of particular interest here are PBIB(2) designs but also, because of the ease of 
construction, cyclic PBIB designs (see Section 5.3) as advocated by Davis and 
Hall (1969). For another class of partially balanced designs see Blaisdell and 
Raghavarao (1980). 


19.5.4 Strongly Balanced Designs for p = t + 1 

We have pointed out earlier that the main interest in crossover designs is to esti¬ 
mate comparisons among direct effects. There are, however, situations where the 
residual effects are of interest in themselves as well. To increase the efficiency 
of estimating residual effect contrasts and to achieve orthogonality between the 
direct and residual effects (see Section 19.7), we consider strongly balanced 
designs. The additional important feature of these designs is that each treatment 
is also preceded by itself 入 * times (see Definition 19.5). 

An obvious way to construct a minimal strongly balanced design is to 
consider the balanced designs CO(?, t, t) and CO (，， 2t, t) of Section 19.5.1 and 
add an extra period by simply repeating the treatment assignments from 
period t. We thus create the so-called extra-period designs (Lucas, 1957) 
CO (I ^ + 1) and CO(t, 2t, ? + 1) with A.* = 1 and 2, respectively. Some other 
extra-period designs based on orthogonal Latin squares were given by Lucas 
(1957). i i ' 
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19.5.5 Strongly Balanced Designs for p < t 

Similar to Theorem 19.1 we can state the following theorem. 

Theorem 19.2 (Afsarinejad, 1990) The necessary and sufficient conditions 
for the existence of a minimal strongly balanced CO(t, n, p) with p < t are 
n = X\t and X\(p — l) = t. 


The construction then follows the method described in Section 19.5.2 with the 
following modifications (Afsarinejad, 1983, 1990): 


1. The last element in the 入 ith generating column in Table 19.1 is c ， +i. 

2. The c vector 


(Cl, C2, . . . , C t , C t+ \) 

(l, t,2, t — 1,..,, t/2, (t + 2)/2, (t + 2)/2) for t even 

(l, t, 3, t — 2,..., (t — 3)/2, (t + 5)/2, 

= (r + 1)/2, (t + 1)/2, (t + 5)/2, 

(t — 3)/2, 1) for ? = 4y + 1 

(l, t, 3, t — 2,..., (t — 1)/2, (t + 3)/2, 

(t + 3)/2, (t — 1)/2 ,... ,t,l) for t = 4y -\-3 


3. The set — Q-i) contains each number mod t and, hence, each 

ordered pair of treatments appears exactly once in the design. 


We illustrate this procedure with the following example. 


Example 19.3 Let t = 6, p = A, and hence =2. Then (c\, C 2 , … ， cq)= 
(1 ， 6, 2, 5, 3, 4, 4). The generating columns are (1, 6, 2, 5) and (5, 3, 4, 4) and 
the final design is 


Period 


Subject 

123456789 10 11 

1123456561 2 3 

2612345345 6 1 

3234561456 1 2 


12 

4 

2 

3 
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19.5.6 Balanced Uniform Designs 

Uniform crossover designs (see Definition 19.3) are not only structurally appeal¬ 
ing but also an important role in connection with the consideration of optimality 
(see Section 19.6). It is easy to verify that the CO(t, t, t) and CO(/, 2t, t) given in 
Section 19.5.1 are uniform crossover designs. From these designs we can obtain, 
of course, CO(?, c\t, C 2 t) and CO(t, 2c\t, C 2 t), respectively, with c\, C 2 positive 
integers. 

19.5.7 Strongly Balanced Uniform Designs 

One method of construction is provided through the proof of the following 
theorem. 

Theorem 19.3 (Cheng and Wu, 1980) If n is a multiple of t 2 and p = 入乂 
with 入 2 even, then there exists a strongly balanced uniform CO(t, n, p). 


A simple way to construct such a design (Cheng and Wu, 1980) is to first 
consider the case n = t 2 and 入 2 = 2. For periods 1 and 2 write down all possible 
ordered pairs of the t treatments as a 2 x r 2 array. To each element in this array 
add i mod t(i = 1 , 2 ,...,?— 1 ) to obtain periods 2/ + 1 and 2 i + 2. 

As an illustration we consider the following example. 


Example 19.4 Take t = 3, thus n = 9, p = 6, leading to the following 
CO(3, 9, 6) with A* = 3: ‘ ‘ 


Subject 


Period 



1 

2 

3 

4 

5 

6 

7 

8 

9 

1 

1 

1 

1 

2 

2 

2 

3 

3 

3 

2 

1 

2 

3 

1 

2 

3 

1 

2 

3 

3 

2 

2 

2 

3 

3 

3 

1 

1 

1 

4 

2 

3 

1 

2 

3 

1 

2 

3 

1 

5 

3 

3 

3 

1 

1 

1 

2 

2 

2 

6 

3 

1 

2 

3 

1 

2 

3 

1 

2 


It is then obvious how one can obtain in general a CO(t, ct 2 , 入 2 ，）with c a 
positive integer and 入 2 even. For a different method of construction see Beren- 
blut (1964). □ 


19.5.8 Designs with Two Treatments 

The most widely used crossover designs involve two treatments, A and B say, 
where in the clinical setting A may be a standard drug or placebo and B may be 
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the test drug. And the simplest design is to consider two sequences of treatment 
application to a subject: AB or BA. For two subjects the CO(2, 2, 2) is given by 

Subject 


Period 

It is, of course, obvious by considering the available degrees of freedom that this 
CO(2, 2, 2) will not allow the estimation of all parameters in model (19.1). Even 
replicating each sequence r times resulting in a CO(2, 2r, 2) does not solve this 
problem. Therefore, designs with additional periods and/or additional sequences 
will have to be considered. 

We shall mention some such possible designs by indicating the number of 
sequences ^ and the number of periods p, as well as the sequences themselves in 
a p x s array. The final design is then obtained by replicating each sequence r 
times (we note that equal replication is desirable, but not necessary). The resulting 
design is then a CO(2, sr, p). 

p = 2, s = 4 (Balaam, 1968): 

A B A B 
B A A B 

p = 3, s = 2: 

(i) A B (ii) A B (iii) A B 
BA BA A B 

BA A B B A 

Design (i) is, of course, an extra-period design as discussed in Section 19.5.4. 
Design (ii) is referred to as a switch-back design. 

p = 3, s = 4: 

Combining various sequences from the designs for p = 3, s = 2 leads to 

(i) A B A B (ii) A B A B 
BABA B A A B 

B A A B BABA 

(iii) A B A B 

B A A B 

ABBA 
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p = 3, s = 6: 

Combine all six sequences from (i) ， (ii), (iii) for p = 3, s = 2. 


= 2: 









(i) 

A 

B 

(ii) 

A 

B 

(iii) 

A 

B 


A 

B 


B 

A 


B 

A 


B 

A 


A 

B 


B 

A 


B 

A 


B 

A 


A 

B 

(iv) 

A 

B 

(v) 

A 

B 


A 

B 


B 

A 


A 

B 


B 

A 


A 

B 


B 

A 


B 

A 


A 

B 


A 

B 


B 

A 

(vii) 

A 

B 








A 

B 








A 

B 








B 

A 








Some of these designs may be combined to obtain designs with 夕 = 4 or 夕 = 6 
sequences, for example. 

A detailed discussion of the designs given above can be found in Jones and 
Kenward (2003) and Ratkowsky, Evans, and Alldredge (1993). An extensive 
discussion of 2-treatment designs is also given by Kershner and Federer (1981). 


19.6 OPTIMAL DESIGNS 

In the previous sections we have discussed different classes of crossover designs 
and have given some methods of constructing designs in these classes. Although 
the designs presented here have certain desirable properties, such as balancedness, 
for example, and may be quite appropriate from a practical point of view, they 
may not necessarily be optimal from a statistical point of view. We shall address 
this question briefly in this section and give a few results. 


19.6.1 Information Matrices 

In order to write out the information matrices for r and p [see model (19.2)], 
we shall follow Cheng and Wu (1980) and Stufken (1996) and use the following 
notation: 

n u j = number of times that treatment u is assigned to subject j 
n u j = number of times that treatment u is assigned to subject j in 
the first p — 1 periods 
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number ot times that treatment u appears in period i 
number of times that treatment u is immediately preceded by 


treatment v 


XI n uj = ^ui = number of replications for treatment u 
7=1 ，=i 

n p—\ t 

E n u j = ^2 ^ui = XI m vu = number of replications for 
j=l i=l u=l 

treatment u in the first p — I periods 

p t 

i u i = m uv = number of times that treatment u appears 

i=2 v=l 

in the last p — 1 periods 


We then have that 


r n uj = p ^2 Hu J = p _ i ^ 


r u =np 


= n{p - 1) 


Now we consider (19.2) and assume, without loss of generality, /i = 0 (or, 
equivalently, define 7 t* = /i3 it). Then the coefficient matrix for the normal 
equations for model(19.2) can be written as 


where, dropping the subscript d for convenience and indicating the dimensions 
instead, 


Lt 乂 p = {.^Ui ) 


之 £ 

n - A A _ - _ 

/ J /rj J 

二 £ 5 
£1 # 仏匕 L ; ," < 仏 /Md 


^ ^ ^ ^ 

rl r ,- ^ 




d 心 z , 
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with l u \ =0, i u i = (i > 2), 

N t~Kn = O^uj) ^ txn = ("«_/•) 

D t xt =diag(ri,r 2 , D txt = diag(?i ，？ 2 , •.. ， M txt = (m uv ) 


We then obtain from (19.7) the information matrix for estimating jointly the 
direct and residual effects as 


C 


— 





Cn\ 

C 22 J 


(19.8) 


where 


Cu^D- n~ l LL' - p~ l NN' + n_ l p_ l N N r 

C 12 = C' 2 l =M~ n~ l LL f - p~ l NN f + n~ l p~ l N 0 n 0' n N f 
C 2 2 = D - n~ x LL - p~ l NN + n~ l p~ l N 0„0'„ N 


It then follows from (19.8) that the information matrices, that is, the coeffi¬ 
cient matrices for the reduced normal equations, for estimating the direct effects 
(adjusted for residual effects) and for estimating the residual effects (adjusted for 
direct effects) are given by 


C r = Cn-Cn C^ 2 C 2 i (19.9) 

and 

c p = C 22 - C 2 1 c~ x C 12 (19.10) 

respectively. 


19.6.2 Optimality Results 

As we have seen in Section 1.13 optimality considerations are based on the 
information matrices for competing designs. For some of the classes of designs, 
which we have discussed in Section 19.5, the matrices (19.9) and (19.10) reduce 
to quite simple forms so that optimal designs can be characterized if not with 
respect to Q t ， n , p then with respect to some suitable subset ^2 c 
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For example, for a uniform CO(?, 入 i ，， f) Hedayat and Afsarinejad (1978) 
show that 


and 


C r = X\t I 


Xi(t 2 - t -l) 


MM' + V( 2 — 

t 2 -t - l r 


c p = u r 一广 1 1 


1 . 入 i (2 _ ?) . 


(19.11) 

(19.12) 


with M such that M0 t 0 f t = X\(t — \)3 t 3 f t . Together with the facts that C T 0 t = 0 
and C r is completely symmetric enables them to state the following theorem. 


Theorem 19.4 A design d* in ^ t is universally optimal for the estima¬ 
tion of direct effects if M(J*)= 入 — /). 

Here Q; k ' t t denotes the set of all uniform CO(?, 入 f) and M(J*) denotes 
the M matrix for design d*. The same result holds with respect to estimating 
residual effects. Now we know from Sections 19.5.6 and 19.5.1 that for a bal¬ 
anced design d in ^ Xlt t with 入 i = 1 for even t and 入 i = 2 for odd t we have 
M(d) = (Ot^t — I) and M(d) = 2(3 t 0 f t — /), respectively. Thus patching A，i (or 
入 i/2) copies of a balanced design d in^ t t (or 2t t ) for t even (odd) yields a 
balanced design d in ^^ lt t with M(d)= 入 — J). Even more generally, 
M(d) = Xi(3 t 0[ — /) if and only if d is balanced (Hedayat and Afsarinejad, 
1978). Thus we have the following theorem. 


Theorem 19.5 A design in ^t universally optimal over ^ t if 
it is balanced. 


Although balancedness is important, Kunert (1984) argues that for sufficiently 
large n one can find designs that are better than uniform balanced designs. At 
the same time he proved the following theorem. 


Theorem 19.6 If a uniform balanced design exists in then 


tr(C T (J*)) 

su PdeQ w [ tr ( C r ⑷)] 


-t 


I)' 


(19.13) 


where C T (d) is the information matrix for direct effects for design d. 


From (19.13) we see that even though a uniform balanced design may not be 
optimal it is nevertheless highly efficient for estimating direct effects for t > 2. 
More specifically, Kunert (1984) showed the following theorem. 

Theorem 19.7 A uniform balanced design d* with n = and p = t is 
universally optimal for direct effects in if 入 i = 1 and ^ > 3, or Ai = 2 

and t > 6. 
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For an extension of this result to t > 3 and 入 i < (，一 1)/2 see Hedayat and 
Yang (2003). ° 

The efficiency of uniform balanced designs forp = t is generally smaller for 
residual effects than it is for direct effects. Here then strongly balanced designs 
are important. 

Theorem 19.8 (Cheng and Wu, 1980) A strongly balanced design d* in 
^2t,n,p is universally optimal for the estimation of direct and residual effects over 

^t,n,p- 

Theorem 19.9 (Cheng and Wu, 1980) Let n = 入 J and p = ^ + 1 and let 
d* be a strongly balanced design in Q t ， n , p , which is uniform on the periods and 
uniform on the subjects in the first p — 1 = k:t periods. Then d* is universally 
optimal for the estimation of direct and residual effects over Qh 

Such designs include, of course, the extra-period crossover designs discussed 
in Section 19.5.4. 

For the proofs of these and other results on optimal designs we refer the reader 
to Cheng and Wu (1980), Kunert (1984), and Stufken (1996). 

19.7 ANALYSIS OF CROSSOVER DESIGNS 

In this section we shall discuss briefly how SAS PROC GLM can be used to 
analyze data from a crossover design using model (19.1). Since the designs in 
general are nonorthogonal, we need to use reduced normal equations to obtain 
estimates of direct and residual effects, as already alluded to in (19.9) and (19.10). 

We consider two examples, one a balanced design and the other a strongly 
balanced design, and use them to point out numerically some differences between 
them. 

Example 19.5 In Table 19.2 we present the data from a Williams square 
CO(4, 4, 4) (see Section 19.5.1) and the input for analyzing them according 
to model (19.1). We comment on some features of the input and output in 
Table 19.2: 

1. In the input we identify periods, subjects, treatments, and residuals by P, S, 
T, C, respectively; 

2. Since there is no residual effect in period 1 and since we cannot leave the 
corresponding place for C blank (as SAS would delete the entire data point), 
we introduce “level” 0 for that purpose; 

3. The general form of estimable functions (see L2) and the type III SS for peri¬ 
ods show that period effects are partially confounded with residual effects; 

4. Treatment and residual effects are nonorthogonal to each other as indicated 
by the fact that the type I and type III SS for treatments are not identical. 
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Table 19.2 Williams Square Crossover Design CO (4, 4, 4) 

options nodate pageno=l ； 
data Williams ； 
input P S T C Y @@； 
datalines ； 

1110 27.5 2141 20.1 3124 23.9 4132 17.2 

1220 29.9 2212 23.6 3231 20.2 4243 16.8 

1330 14.3 2323 15.7 3342 12.3 4314 17.2 

1440 19.3 2434 12.8 3413 20.0 4421 23.4 

/ 

run ； 

proc print data=Williams ； 
titlel 'TABLE 19.2'; 

title2 'WILLIAMS SQUARE CROSSOVER DESIGN'; 

title3 'CO(4, 4, 4)'; 

run ； 

proc glm data=Williams ； 

class P S T C ； 

model Y=P S T C/e solution ； 

title3 'ANALYSIS OF VARIANCE 1 ; 

run ； 


Obs 

P 

S 

T 

C 

Y 

1 

1 

1 

1 

0 

27.5 

2 

2 

1 

4 

1 

20.1 

3 

3 

1 

2 

4 

23.9 

4 

4 

1 

3 

2 

17.2 

5 

1 

2 

2 

0 

29.9 

6 

2 

2 

1 

2 

23.6 

7 

3 

2 

3 

1 

20.2 

8 

4 

2 

4 

3 

16.8 

9 

1 

3 

3 

0 

14.3 

10 

2 

3 

2 

3 

15.7 

11 

3 

3 

4 

2 

12.3 

12 

4 

3 

1 

4 

17.2 

13 

1 

4 

4 

0 

19.3 

14 

2 

4 

3 

4 

12.8 

15 

3 

4 

1 

3 

20.0 

16 

4 

4 

2 

1 

23.4 


The GLM Procedure 
Class Level Information 
Class Levels Values 


P 


4 


12 3 4 
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Table 19.2 (Continued) 

S 4 12 3 4 

T 4 12 3 4 

C 5 0 1 2 3 4 

Number of observations 16 
General Form of Estimable Functions 


Effect 


Coefficients 

Intercept 


LI 

P 

1 

L2 

P 

2 

L3 

P 

3 

L4 

P 

4 

L1-L2-L3-L4 

S 

1 

L6 

S 

2 

L7 

S 

3 

L8 

S 

4 

L1-L6-L7-L8 

T 

1 

L10 

T 

2 

Lll 

T 

3 

L12 

T 

4 

L1-L10-L11-L12 

C 

0 

L2 

C 

1 

L15 

C 

2 

L16 

C 

3 

L17 

C 

4 

L1-L2-L15-L16- 


Dependent Variable : Y 


Source 

DF 

Sum of 
Squares 

Mean Square 

F Value 

Pr > F 

Model 

12 

378.8035000 

31.5669583 

57.26 

0.0033 

Error 

3 

1.6540000 

0.5513333 



Corrected Total 

15 

380.4575000 




R-Square 


Coeff Var 

Root MSE 

Y Mean 



0.995653 


3.781124 


0.742518 


19.63750 
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Table 19.2 (Continued) 


Source 

DF 

Type I SS 


Mean Square 

F Value 

Pr > F 

P 

3 

53.8875000 


17.9625000 

32.58 

0.0086 

S 

3 

154.5075000 


51.5025000 

93.41 

0.0018 

T 

3 

149.8475000 


49.9491667 

90.60 

0.0019 

C 

3 

20.5610000 


6.8536667 

12.43 

0.0337 

Source 

DF 

Type III SS 


Mean Square 

F Value 

Pr > F 

P 

2 

2.2200000 


1.1100000 

2.01 

0.2790 

S 

3 

111.9660000 


37.3220000 

67.69 

0.0030 

T 

3 

161.8278182 


53.9426061 

97.84 

0.0017 

C 

3 

20.5610000 


6.8536667 

12.43 

0.0337 





Standard 



Parameter 


Estimate 


Error 

t Value 

Pr > |t| 

Intercept 


14.67500000 

B 

0.76085478 

19.29 

0.0003 

P 

1 

4.67000000 

B 

0.66412850 

7.03 

0.0059 

P 

2 

-0.60000000 

B 

0.52503968 

-1.14 

0.3361 

P 

3 

0.45000000 

B 

0.52503968 

0.86 

0.4544 

P 

4 

0.00000000 

B 




S 

1 

2.86000000 

B 

0.55066626 

5.19 

0.0139 

S 

2 

3.59500000 

B 

0.55066626 

6.53 

0.0073 

S 

3 

-3.45500000 

B 

0.55066626 

-6.27 

0.0082 

S 

4 

0.00000000 

B 




T 

1 

5.65000000 

B 

0.55066626 

10.26 

0.0020 

T 

2 

6.25500000 

B 

0.55066626 

11.36 

0.0015 

T 

3 

-1.28500000 

B 

0.55066626 

-2.33 

0.1018 

T 

4 

0.00000000 

B 




C 

0 

0.00000000 

B 




C 

1 

2.80000000 

B 

0.66412850 

4.22 

0.0244 

C 

2 

0.62000000 

B 

0.66412850 

0.93 

0.4194 

C 

3 

-1.14000000 

B 

0.66412850 

-1.72 

0.1846 

C 

4 

0.00000000 

B 





NOTE : The X'X matrix has been found to be singular, and a gen¬ 
eralized inverse was used to solve the normal equations. 
Terms whose estimates are followed by the letter are 

not uniquely estimable. 


5. Since we are considering a balanced design, we have vard the same 

for each u, u\u ^ u ’），with the same holding true for var(p M — ^p u '). A partial 
verification of this is that st(T u ) = st(j u — = .55 for u = l, 2, 3, and 

se(C M ) = se(p M — P 4 ) = .66 for w = 1, 2, 3. □ 

Example 19.6 In Table 19.3 we present the data for an extra-period design 
CO(4, 4, 5) (see Section 19.5.4) and the input for analyzing them. Again, we 
shall comment briefly on some features of the input and output in Table 19.3: 
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Table 19.3 Extra-Period Crossover Design CO (4, 4, 5) 


options nodate pageno=l ； 







data extra ； 
input P S T C Y @@； 
if C=0 then C=4; 
datalines ； 








1110 27.5 214 

1 20.1 

3 12 4 

23.9 

4 

13 2 

17.2 

5133 18.5 122 

0 29.9 

2 2 12 

23.6 

3 

2 3 1 

20.2 

4243 16.8 524 

4 17.1 

13 3 0 

14.3 

2 

3 2 3 

15.7 

3342 12.3 431 

4 17.2 

5 3 11 

18.0 

1 

4 4 0 

19.3 

2434 12.8 341 

3 20.0 

4 4 2 1 

23.4 

5 

4 2 2 

21.4 

/ 

run ； 








proc print data=extra ； 







titlel 'TABLE 19.3 '; 








title2 'EXTRA-PERIOD CROSSOVER 

DESIGN 1 ; 




title3 '00(4,4,5)'; 








run ； 








proc glm data=extra ； 
class P S T C ； 








model Y=P S T C/e solution ； 







lsmeans T/stderr ； 








title3 'ANALYSIS OF 

VARIANCE 1 ; 






run ； 








Obs 

P 

S 

T 

C 


Y 


1 

1 

1 

1 

4 


27.5 


2 

2 

1 

4 

1 


20.1 


3 

3 

1 

2 

4 


23.9 


4 

4 

1 

3 

2 


17.2 


5 

5 

1 

3 

3 


18.5 


6 

1 

2 

2 

4 


29.9 


7 

2 

2 

1 

2 


23.6 


8 

3 

2 

3 

1 


20.2 


9 

4 

2 

4 

3 


16.8 


10 

5 

2 

4 

4 


17.1 


11 

1 

3 

3 

4 


14.3 


12 

2 

3 

2 

3 


15.7 


13 

3 

3 

4 

2 


12.3 


14 

4 

3 

1 

4 


17.2 


15 

5 

3 

1 

1 


18.0 


16 

1 

4 

4 

4 


19.3 


17 

2 

4 

3 

4 


12.8 


18 

3 

4 

1 

3 


20.0 


19 

4 

4 

2 

1 


23.4 


20 

5 

4 

2 

2 


21.4 
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Table 19.3 (Continued) 


The GLM Procedure 
Class Level Information 
Class Levels Values 


P 

S 

T 

C 


5 1 2 3 4 5 

4 12 3 4 

4 12 3 4 

4 12 3 4 


Number of observations 20 
General Form of Estimable Functions 


Effect 


Coefficients 

Intercept 


LI 

P 

1 

L2 

P 

2 

L3 

P 

3 

L4 

P 

4 

L5 

P 

5 

L1-L2-L3-L4-L5 

S 

1 

L7 

S 

2 

L8 

S 

3 

L9 

S 

4 

L1-L7-L8-L9 

T 

1 

Lll 

T 

2 

L12 

T 

3 

L13 

T 

4 

L1-L11-L12-L13 

C 

1 

L15 

C 

2 

L16 

C 

3 

L17 

C 

4 

L1-L15-L16-L17 


Dependent Variable : Y 


Source 

DF 

Sum of 
Squares 

Mean Square 

F Value 

Pr > F 

Model 

13 

385.1805000 

29.6292692 

21.77 

0.0006 

Error 

6 

8.1675000 

1.3612500 




Corrected Total 19 393.3480000 
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Table 19.3 {Continued) 

R-Square Coeff Var Root MSE Y Mean 

0.979236 5.995510 1.166726 19.46000 


Source 

DF 

Type I SS 


Mean Square 

F Value 

Pr > F 

P 

4 

56.4080000 


14.1020000 

10.36 

0.0073 

S 

3 

119.2600000 


39.7533333 

29.20 

0.0006 

T 

3 

190.4300000 


63.4766667 

46.63 

0.0001 

C 

3 

19.0825000 


6.3608333 

4.67 

0.0518 

Source 

DF 

Type I 工工 SS 


Mean Square 

F Value 

Pr > F 

P 

4 

52.2875000 


13.0718750 

9.60 

0.0089 

S 

3 

167.4140000 


55.8046667 

41.00 

0.0002 

T 

3 

190.4300000 


63.4766667 

46.63 

0.0001 

C 

3 

19.0825000 


6.3608333 

4.67 

0.0518 





Standard 



Parameter 


Estimate 


Error 

t Value Pr > |t| 

Intercept 


14.20416667 

B 

1.02435193 

13.87 

<.0001 

P 

1 

4.88750000 

B 

0.96739825 

5.05 

0.0023 

P 

2 

-0.70000000 

B 

0.82500000 

-0.85 

0.4287 

P 

3 

0.35000000 

B 

0.82500000 

0.42 

0.6862 

P 

4 

-0.10000000 

B 

0.82500000 

-0.12 

0.9075 

P 

5 

0.00000000 

B 




S 

1 

3.45000000 

B 

0.75311852 

4.58 

0.0038 

S 

2 

3.42500000 

B 

0.75311852 

4.55 

0.0039 

S 

3 

-3.70833333 

B 

0.75311852 

-4.92 

0.0026 

S 

4 

0.00000000 

B 




T 

1 

5.56666667 

B 

0.75311852 

7.39 

0.0003 

T 

2 

6.42500000 

B 

0.75311852 

8.53 

0.0001 

T 

3 

-0.52500000 

B 

0.75311852 

-0.70 

0.5118 

T 

4 

0.00000000 

B 




C 

1 

2.67500000 

B 

0.82500000 

3.24 

0.0176 

C 

2 

0.87500000 

B 

0.82500000 

1.06 

0.3297 

C 

3 

-0.00000000 

B 

0.82500000 

-0.00 

1.0000 

C 

4 

0.00000000 

B 




NOTE: The 

X'X 

matrix has been 

found to be 

singular, and 

a gen- 


eralized inverse was used to solve the normal equations. 
Terms whose estimates are followed by the letter 'B' are 
not uniquely estimable. 
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Table 19.3 (Continued) 



Least 

Squares Means 




Standard 


T 

Y LSMEAN 

Error 

Pr > |t| 

1 

22.3375000 

0.5394137 

< . 0001 

2 

23.1958333 

0.5394137 

< . 0001 

3 

16.2458333 

0.5394137 

< . 0001 

4 

16.7708333 

0.5394137 

< . 0001 


1. The input statement ‘If C = 0 then C = 4’ is included for the sole purpose 
of obtaining least squares (LS) means (see Ratkowsky, Evans and Alldredge, 
1993) since otherwise LS means are “nonestimable” (see also (5) below); 

2. With the statement in (1) there is no longer any confounding between period 
and residual effects; 

3. Treatment and residual effects are now orthogonal to each other [independent 
of the statement in (1)] as indicated by the equality for the type I and type 
III treatment SS; 

4. The balancedness property is again indicated in the standard errors for treat¬ 
ments and residuals; 

5. The numerical value for the treatment LS means does depend on the statement 

in (1) ‘If C = 0 then C = u’ for u = 1, 2, 3, 4, but differences between LS 
means are independent of the value of u. □ 


19.8 COMMENTS ON OTHER MODELS 

All our discussions up to this point are based on model (19.1) with all effects, 
except e, assumed to be fixed effects. From many points of view — practical and 
theoretical — this seems to be the most plausible model, but in certain cases good 
arguments can be made for other models. We shall conclude this chapter by 
commenting briefly on some of these models without going into all the details. 

19.8.1 No Residual Effects 

The presence of residual effects not only affect the analysis of the data from a 
crossover design but also the interpretation of such an analysis. For example, 
we have mentioned earlier that the 2-treatment, 2-period, 2-sequence (AB, BA) 
design cannot be analyzed under model (19.1). Subject matter knowledge may 
suggest that no residual effects exist. Or appropriate wash-out periods can elim¬ 
inate residual effects, but they may lengthen considerably the duration of the 
experiment. In any event, the design problem becomes then one of Latin square- 
type designs (see 1.10). Obviously, we no longer need strongly balanced designs. 
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We should warn, however, that the use of a balanced design in itself does not 
automatically imply that residual effects can be dropped from model (19.1)，as 
encountered often in the applied literature. 

19.8.2 No Period Effects 

To overcome the problem mentioned above for the 2-treatment, 2-period, 2- 
sequence design, it may be quite reasonable to assume that there are no period 
effects. That may be true for other situations as well where the number of periods 
is relatively small. In this case the design reduces to a block design (with the 
subjects representing the blocks) but with restricted randomization so that the 
balancedness property can be preserved. 

19.8.3 Random Subject Effects 

In certain situations there may be good arguments that the subject effects are 
random rather than fixed effects. This is a question that cannot be answered 
generally but needs to be decided on a case-by-case basis. 

If the subject effects are considered to be random effects, then the analysis 
differs from that presented in Section 19.7 in that now in addition to intrasub¬ 
ject information also intersubject information with respect to direct and residual 
effects becomes available (this is, of course, akin to the interblock information 
for block designs as discussed in Section 1.7). Computationally this can be han¬ 
dled by using, for example, SAS PROC MIXED with the appropriate RANDOM 
statement. 

One other consequence of assuming random subject effects is an increase of 
the standard error for direct and residual least-squares means. 

19.8.4 Autocorrelation Error Structure 

It is plausible to argue that since we obtain p observations on the same subject 
the observations are correlated. In the context of model (19.1) this implies that for 
a given j the e" are correlated. One way to look at such a correlation structure 
is to consider the p observations for a subject a short time series and hence the 
entire set of observations as n time series. The simplest correlation structure in 
this context is induced by a first-order autoregression with 

etj = pet-ij + r}ij(i > 1) e\j = r]\j (19.14) 

and —1 < p < 1. The rjtj are assumed to be independently distributed with mean 
0 and 


var(w)= 


'a 2 /(l -p 2 ) 


for i = 1 
for i > 1 
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Then the variance-covariance structure for e is given by 

varO) = a 2 {I n x V) 

where V = (V///) and V/// = pi’ 一 〆 1/(1 — p 2 ) for ^ i f = 1,2 , ... ， p. 

The autocorrelation structure (19.14) leads to an information matrix that is 
a function of p, a generally unknown parameter [for an explicit expression for 
the joint information matrix for direct and residual effects see Matthews (1987)]. 
As a consequence it becomes much more difficult to obtain general results for 
optimality of designs. 

Even for ^ = 2 and p = 3, 4 the results are very complicated and not very 
practical (Matthews, 1987). The problem is compounded by the fact that we 
usually do not know the correct value of p in (19.14). A useful strategy then 
would be to choose a design that has good properties, that is, is efficient, over a 
reasonable range of p rather than choose the optimal design for a particular value 
of p. Along these lines Matthews (1987) has shown that for ， = 2 and p = 3 an 
equal number of subjects for the sequences ABB, BAA, AAB, BBA yields a 
highly efficient design for estimating direct effects and ABB, BAA, ABA, BAB 
for estimating residual effects. For t = 2 and /? = 4 the corresponding result is 
that the sequences ABBA, BAAB, AABB, BBAA yield a highly efficient design 
over the full range of p for estimating direct and residual effects. As Matthews 
(1987) pointed out, an interesting property of the design above for p = 4 is that 
if the experiment had to be stopped after three or two periods the resulting design 
is still highly efficient. 

We have seen earlier that balanced crossover designs play an important role. 
Because their information matrix C T in (19.11) is completely symmetric it fol¬ 
lows, of course, that comparisons between two direct effects are estimated with 
the same variance. Gill (1992) investigated this property for the case of autocor- 
related errors (remember that C T is now a function of p). He found that for p < ^ 
and t a prime or prime power the method proposed by Williams (1949) can be 
used to construct a balanced design with t(t — 1) subjects. Let L\, L 2 ,..., L t _i 
be the t — l mutually orthogonal Latin squares of size t in semistandard order; 
that is, the elements in the first row of Li, L 2 , ..., L t -\ are in natural order. 
Then adjoining L\, L 2 , … ， L t -\ yields a balanced CO (，， t(t — 1), t). 

For further results concerning Williams designs see Kunert (1985). 


19.8.5 Second-Order Residual Effects 

It may not be very common, but it is entirely possible that the effect of a treatment 
lingers on for two periods (and perhaps even more). In such a case we are not 
only dealing with first-order residual effects as we have discussed so far, but also 
with second-order residual effects. This changes model (19.1) to 

yij = /^ + 兀 / + ^/ + ^d(i,j) Yd(i—2,j) ^ij (19.15) 
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where Yd(i-2,j) indicates the second-order residual effect in period i for the 
treatment that the design d assigned to subject j in period i — 2. 

Consideration of second-order residual effects obviously further complicates 
the construction of efficient crossover designs. Williams (1950) proposed a 
method based on mutually orthogonal Latin squares with the property that each 
treatment is preceded by each ordered pair of different treatments (see 1.10.7.3). 
For these designs and under model (19.15) the direct effects are nonorthogonal 
to their first-order and second-order residual effects. 

An orthogonal CO(2, 4, 6) was proposed by Quenouille (1953) (see also John 
and Quenouille, 1977). His idea was extended for ^ > 2 by Sharma (1977). His 
method produces what he refers to as completely balanced designs. 

Definition 19.8 A design d in ^lt,n,p is called a completely balanced design 
if (1) d is uniform, (2) d is strongly balanced for first-order residual effects, 
(3) each treatment is preceded equally often by every other treatment (including 
itself) by two periods, and (4) each ordered pair of treatments occurs equally 
often in the last two periods. □ 

It is easy to see that a minimal completely balanced design is a CO(?, t 2 , 3t). 
Sharma (1977) introduced a method of constructing such designs through the use 
of orthogonal arrays of the form OA[t 2 , 3, t, 2; 1]: Such an orthogonal array can 
be obtained by adjoining t arrays of the form 

"1 2 ... t ' 

u u ••- u (u = 1,2,... ,t) (19.16) 

_U U ~\~ \ ... II ~\~ t — 1 _ 

The CO(?, t 2 , 3t) is obtained from developing the full array (19.16) by adding 
successively 1, 2,— 1 to its three rows, always mod t. 

We illustrate this procedure for the probably most useful cases t = 2 and t = 3 
in the following examples. 

Example 19.7 For t = 2 the array (19.16) (periods 1, 2, 3) and the final 
CO(2, 4, 6) are obtained as 
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Example 19.8 For t = 3 we obtain the following CO(3, 9, 9), with the first 
three rows constituting array (19.16): 


Subject 


Period 



1 

2 

3 

4 

5 

6 

7 

8 

9 

1 

1 

2 

3 

1 

2 

3 

1 

2 

3 

2 

1 

1 

1 

2 

2 

2 

3 

3 

3 

3 

1 

2 

3 

2 

3 

1 

3 

1 

2 

4 

2 

3 

1 

2 

3 

1 

2 

3 

1 

5 

2 

2 

2 

3 

3 

3 

1 

1 

1 

6 

2 

3 

1 

3 

1 

2 

1 

2 

3 

7 

3 

1 

2 

3 

1 

2 

3 

1 

2 

8 

3 

3 

3 

1 

1 

1 

2 

2 

2 

9 

3 

1 

2 

1 

2 

3 

2 

3 

1 


□ 


19.8.6 Self and Mixed Carryover Effects 

One important characteristic of strongly balanced designs is the fact that each 
treatment is preceded by itself 入 2 times. This means that some observations 
are a function of the direct and residual effects of the same treatment, whereas 
other observations are influenced by direct and residual effects from different 
treatments. Model (19.1) does not distinguish between these two situations in that 
p u is the same regardless of whether it occurs together with x u or x u <(u ^ u’）. 
This has sometimes been considered as a weakness of the model. One can argue 
that there might be some form of interaction between direct and residual effects 
of treatments. This was, indeed, considered by Sen and Mukerjee (1987), but the 
ensuing model becomes too complicated and not practically useful. 

A different approach to this problem, which can be considered as a special 
form of direct-residual treatment interaction, was first proposed by Afsarine- 
jad and Hedayat (2002) and further considered by Kunert and Stufken (2002). 
Following their development, we consider two types of carryover effects: self 
carryover effect, which occurs when treatment u is followed by itself, and mixed 
carryover effect, which occurs if treatment u is followed by treatment u' (u ^ u f ), 
independent of what u' is. This distinction then leads to the following model 
[using the same notation as for model (19.1)]: 


yij = /^ + TT/ + 〜• + Tj ( 心 •）+ 


Pd(i-hj) 

Xd(i-l,j) 


if d(i, j) ^ d(i - 1, j)) 

if d(ij) = d(i-lj)\^ eij 
(19.17) 


Obviously, Pd{i-\,j) refers to the mixed and Xd(i-i,j) t0 the self carryover effect. 
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Based on model (19.15) Afsarinejad and Hedayat (2002) provide useful designs 
for ? > 2 and p = 2, and Kunert and Stufken (2002) discuss the optimality of 
designs for ? > 3 and p > 2. We shall report some of their findings, but refer to 
the original articles for details. 

1. p = 2 The CO(^, n, 2) consists of two parts, say D\ and D 2 , where in 
D\ n\ subjects receive the same treatment in both periods and in D 2 ^2 subjects 
receive different treatments in periods 1 and 2, with n\ n 2 = n. It is obvious 
that we can estimate differences between the Xu from D\, and hence we would 
require n\ > t. Of critical importance then becomes the construction of D 2 . 

One method to construct a CO(^, n 2 , 2) for estimating direct effects is to 
use each ordered pairs of treatments for ri 2 = t{t — 1) subjects. This may lead 
to a rather large design. An alternative method is derived by Afsarinejad and 
Hedayat (2002) through the use of BIB designs. It can be described as follows: 
Consider a symmetric BIB design with t blocks of size k. Write the design in 
the form of a A: x ? Youden square, H say. Add to // a new row to obtain 
a (k l) x t array, H* say, that maintains the Youden square property. Then 
assign k subjects to each treatment for the first period in D 2 . Consider, for 
example, treatment u(u = 1,2,..., t). Search for treatment u in the first row 
of H*, and then assign successively those treatments to the second period that 
appear in the corresponding column of H. The resulting design D 2 is an optimal 
design in the subclass of CO(^, n ， 2 , 2). 

To illustrate this method we consider the following example. 


Example 19.9 For t = 1 and 众 = 3 we have the symmetric BIB design 


H: 


1 2 3 4 5 6 7 

2 3 4 5 6 7 1 

4 5 6 7 1 2 3 


We obtain 

3 4 5 6 7 1 2 

, 1 2 3 4 5 6 7 

. A • 2 3 4 5 6 7 1 

4 5 6 7 1 2 3 


Then D 2 is given by the 21 x 2 array 


D 2 ： 


3334445556667771 1 1 2 2 2 

124235346457561672713 


and the entire design uses + ^2 = 7 + 21 = 28 subjects. 


□ 
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2. p > 2 The results of Kunert and Stufken (2002) are closely tied to the 
notions of balanced block designs and generalized Youden designs (see Kiefer, 
1958, 1975a, 1975b). 。 ^ 

Definition 19.9 A block design with t treatments and blocks of size k is 
called a balanced block design if (1) each treatment is replicated equally often, 
(2) each pair of treatments occurs together the same number of times, and (3) 
\riij — k/t\ < 1 for all /, j (i = 1,2,... ,t; y = 1 , 2,... , b), where ""is the 
number of times treatment i appears in block j (see Section 1.3.1). □ 

Definition 19.10 A row-column design is called a generalized Youden 
design if it is balanced block design when considering rows as blocks and 
columns as blocks. □ 

Investigating the properties of the information matrix C T under model (19.15) 
with regard to uniform optimality Kunert and Stufken (2002) introduced the 
following definition. 

Definition 19.11 A design d in ^ t ,n,p is called totally balanced if the fol¬ 
lowing conditions hold: 

1. J is a generalized Youden design; 

2. J is a balanced block design in the carryover effects; 

3. d is balanced for carryover effects; 

4. The number of subjects where both treatment u and u f appear [p/t] + 1 

times and treatment u f does not appear in the last period is the same for 
every pair u ♦ u’. □ 

To establish universal optimality we then have the following theorem. 

Theorem 19.10 (Kunert and Stufken, 2002) For t >3 and 3 < p < 2t, if a 
totally balanced design d* in exists, then J* is universally optimal for the 

estimation of direct effects over ^l t , n , p . 

No general methods of constructing such designs exist, but they can sometimes 
be obtained by modifying existing designs considered in earlier sections. We 
illustrate this with the following example. 

Example 19.10 For ? = 3, p = 4, and n = 6 the extra-period design 
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is universally optimal under model (19.1). Now d is strongly balanced for residual 
effects, rather than balanced. Hence d is not totally balanced. Changing the 
arrangement in the last period leads to the following totally balanced design: 

"1 2 3 1 2 3' 

*_ 3 1 2 2 3 1 

^ = 2 3 1 3 1 2 

_1 2 3 1 2 3_ 

Thus is universally optimal under model (19.15). □ 

Kunert and Stufken (2002) point out the interesting fact that in optimal designs 
under model (19.15) generally no self carryover effects occur. This shows that 
optimality may not always be the ultimate criterion because in some applications 
it may be useful to obtain estimates of self carryover effects, for example, in 
connection with a single drug therapy for a chronic disease (Afsarinejad and 
Hedayat, 2002). 

19.8.7 Factorial Treatment Structure 

So far we have tacitly assumed that we are dealing with t qualitative treatments. 
It is, however, entirely possible that we could have quantitative treatments. For 
example, our treatments may be represented by one or more factors at several 
levels. This fact may be used to construct special designs that achieve certain 
statistically desirable properties, such as certain types of orthogonality. Such 
designs were discussed by for example, Berenblut (1967, 1968) and Mason and 
Hinkelmann (1971). 

Here we shall discuss briefly yet another type of treatment structure, namely a 
factorial treatment structure. We shall limit ourselves to the case of 2 m factorials, 
although the approach can be extended easily to other factorials. An obvious 
way of proceeding is to simply equate the 2 m treatment combinations to the t 
treatments and use the designs we have discussed earlier. This will lead, however, 
very quickly to a prohibitively large number of periods. To reduce the number of 
periods we shall make the assumption that some interactions are negligible and 
then use appropriate systems of confounding. 

In order to use this approach it is useful to use in model (19.1) the reparam¬ 
eterization (7.42) and rewrite the model as 

yij = M + TT/ + ~ + + Cl r (zd(i — 1,7)) ^ij 

where Xd(i,j) represents the treatment combination x assigned by d to subject 
j in period i, and Zd{i-\,j) the treatment combination z assigned to subject j 
in period / — 1. Further, a(xd(ij)) and a r (Zd(i-\,j)) represent the true direct 
and residual effects in terms of main effect and interaction effect components 
K'x = A< 2 " ' Aa m)ct'x and E a r a , x = (A^ 1 A a 2 2 •■- A^) r ^ x , respectively. 
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implying that for subject 1 we have the association (0, 0, 0 ) 三 1, (1, 0, 1) = 2, 
(0, 1 ， 1) 三 3, （ 1, 1, 0) 三 4, with a similar association for subject 5. 

From the design in Table 19.4 we can estimate A, B, AB, C, AC, BC and 
A r , B r , (AB) r , C r , (AC) r , (BC) r . We point out, however, that the effects like 


Table 19.4 CO(2 3 , 8, 4) with ABC Confounded 

Subjects 

Periods 1 2 3 4 5 6 7 8 


(0,0,0) 

(1,0,1) 

(0,1,1) 

(U,0) 

(1,1,1) 

(0,1,0) 

(1,0,0) 

(0,0,1) 

(1,1,0) 

(0,0,0) 

(1,0,1) 

(0,1,1) 

(0,0,1) 

(1,1,1) 

(0,1,0) 

(1,0,0) 

(1,0,1) 

(0,1,1) 

(1,1,0) 

(0,0,0) 

(0,1,0) 

(1,0,0) 

(0,0,1) 

(1,1,1) 

(0,1,1) 

(1,1,0) 

(0,0,0) 

(1,0,1) 

(1,0,0) 

(0,0,1) 

(1,1,1) 

(0,1,0) 


The method to construct suitable designs is to first construct the block design 
confounding selected interactions and their generalized interactions with blocks. 
These blocks constitute the generating subjects. Each generating subject is then 
developed into a Williams square as described in Section 19.5.1. Formally we 
can describe the result as in the following theorem. 

Theorem 19.11 (Shing and Hinkelmann, 1992) For a 2 m factorial with i = 
2 q ~ l effects E 爲 1 , E ^ 2 ,... ， negligible (this set is made up of q independent 
interactions and their generalized interactions), where m > q l, there exists a 
CO(2 m , 2 m , 2 m ~ q ) such that all nonnegligible direct and residual effects and 
interactions are estimable. 

To construct such a design we assign 2 m _ q treatment combinations to each of 
2 q subjects in arbitrary order by confounding E^ 1 , E ^ 2 ,..., with subjects. 
Each of the 2 q initial subjects is then developed into a Williams square. We 
adjoin the 2 q Williams squares to obtain the final CO(2 m , 2 m , 2 m ~ q ). 

We illustrate this procedure in the following example. 

Example 19.11 Suppose we have a 2 3 factorial with factors A, B, C. Sup¬ 
pose further that we want to use p = 4 = 2 2 periods, that is, we have m = 3, 
q = l. Thus we need to confound one interaction with subjects, say ABC. The 
final design is then given in Table 19.4, with subjects 1 and 5 being the initial 
subjects, and using the fact that the Williams square, that is, CO(4, 4, 4) is of 
the form 


4 3 12 
3 2 4 1 
2 13 4 
14 2 3 
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A r have no particular meaning, except that they represent linear combinations of 
residual effects. □ 

Obviously, for large values of m this method may lead to rather large designs. 
In that case we may need to use fractional factorials together with a system 
of confounding. More specifically, if we use a 2 m _ s fraction and 2 m ~ s ~ q peri¬ 
ods, then we can construct a CO(2 m_5 , 2 m ~ s , 2 m ~ s ~ q ) by the method described 
above. However, this must be considered carefully because this may not lead to 
practically useful designs for all values of m, s, and q. 

For another approach to constructing crossover designs with a factorial treat¬ 
ment structure we refer to Fletcher, Lewis, and Matthews (1990). They consider 
in particular the case of two factors with up to four levels in three or four periods. 
Their designs are obtained by cyclic generation from one or more suitably chosen 
initial sequences. A table of such sequences is provided by Fletcher, Lewis, and 
Matthews (1990). 
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Fields and Galois Fields 


The notion of a finite field, in particular a Galois field, plays an important role in 
the construction of experimental designs (see, e.g., Chapters 3 and 13). We shall 
give here the elementary notions of fields and Galois fields. For a more thorough 
discussion of this topic the reader is referred to, for example, Carmichael (1956). 

Consider a set of ^ distinct elements (marks) U = {wo, u\,..., u s -\}. 

Definition A.l The set IX is said to be a finite field of order s if its elements 
satisfy the following conditions: 

1. The elements may be combined by addition with the laws 

Ui Uj = Uj -\- Ui 
H-i (w j W 灸 ) = (M/ + My) ~h Uj^ 

2. The sum of any two elements is an element in U. 

3. Given two elements U[ and Uk ，there is a unique element Uj such that 
m + Uj = Uk. Then Uj is said to be determined by subtraction, and we 
write uj = Uk — U(. 

4. The element having the additive property of zero is uq so that 

uq + uj = uj or uq = uj — u j 

for all j. 

5. The elements may be combined by multiplication with the laws 

UiUj = UjUi 
Ui {u jU]^) = {Ui U 
Ui(Uj + Uk) = U[U j + UiUk 
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Table A.l Primitive Marks of 
Finite Fields of Order p 


Order of 
Field (p) 

Primitive 
Mark (u) 

3 

2 

5 

2 

7 

3 

11 

2 

13 

2 

17 

3 

19 

2 

23 

5 


6. The product of any two elements is an element in U. 

7. From the relations 

UQUi = UiUO = Ui(U j — U j) = UiU j — U[U j = Uo 

it follows that wo has the multiplicative properties of zero. 

8. Given any Ui(^ uq) and any there is a unique uj such that 

UiU j — Uk 

Then uj is said to be determined by division, and uj is called the quotient 
of u/c by Ui. 

9. The quotient w //ui has the multiplicative properties of unity and is chosen 

to be Ui. □ 

The finite field of s = p elements, where p is a prime number, may be 
represented by uo = 0, u\ = 1, M 2 = 2,..., u p -\ = p — l. Addition and multi¬ 
plication are the ordinary arithmetic operations, except that the resulting number 
is reduced mod p, that is, the resulting number is replaced by the remainder after 
dividing by p. 

Every element u G U(u ^ uq) satisfies the equation u p_1 = 1. If for a given 
u, p — 1 is the smallest power satisfying this equation, then u is called a primitive 
mark of the field. Then the marks u, u 2 , w 3 ,..., u p ~ l are all distinct and hence 
represent, in some order, the elements of U. Examples of primitive marks (roots) 
for different values of p are given in Table A.l. As an illustration consider the 
following example. 

Example A.l Consider p = 7, u = 3, u l = 3, w 2 = 3 2 = 2, w 3 = 2 • 3 = 
6， = 6.3 = 4, / = 4.3 = 5， = 5 • 3 = 1. D 


The field of order p described above is a special case of a finite field of order 
s = p n , where p is a prime number. Such a field is referred to as a Galois field 
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and is generally denoted by GF(〆）. For n = l the elements are the residues 
modulo p as described above. For /i > 1 the elements are obtained as follows. 

Let P(x) be a given polynomial in x of degree n with integral coefficients 
and let F(x) be any polynomial in x with integral coefficients. Then F(x) may 
be expressed as 

F(x) = f(x) + p - q(x) + P(x) 20) 

where 

f(x) = Z?o + b\x + b 2 X 2 + ... + b n -\ x n ~ l 


and coefficients bi (i = 0, 1,n — l) belonging to GF(p), that is, the set It = 
(0, 1, 2,..., p — 1). This relationship may be written as 

F(x) = f(x) mod p, P(x) (A.l) 

and we say that f(x) is the residue modulis p and P(x). The functions F(x), 
which satisfy (A.l) when f(x), p, and P(x) are kept fixed, form a class. If p 
and P(x) are kept fixed, but f(x) is varied, p n classes may be formed since 
each coefficient in f(x) may take the p values 0, 1, 2,..., p — 1. 

It may be readily verified that the classes defined by the f(xYs make up a 
field. For example, if Fi (x) belongs to the class corresponding to f[ (x), and Fj (x) 
corresponding to then Fi(x) + Fj(x) belongs to the class corresponding to 

fi(x) + fj(x). The other operations defined for a field are, likewise, satisfied, any 
function obtained by ordinary algebraic operations being replaced by its residue 
modulis p and P(x). In order that division be unique, it is also necessary that p 
be a prime number and that P(x) cannot be expressed in the form 

P(x) = P { (x) P 2 (x) + p P 3 (x) (A.2) 

where P/ (x) are polynomials in x with integral coefficients, and the degrees of 
Pi(jc) and P 2 (x) being positive and less than the degree of P(x). Any P(x) that 
does not admit the representation is thus irreducible mod p. 

Summarizing the preceding discussion, we have the following definition. 

Definition A.2 For p a prime number and P(x) an irreducible polynomial 
mod p, the residue classes given by (A.l) form a finite field of order p n . This 
field is called a Galois field, and is denoted by GF(p^). □ 

As an illustration we consider the following example. 

Example A.2 Let p = 3 and consider GF(3 2 ). Then P(x) is of degree 2, 
that is, of the form a 。 + a\x + with ao, a\,a 2 G {0, 1, 2}. For P(x) to be 
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irreducible mod 3 we can exclude ao = 0. Suppose we take ao = l. Then P(x) is 
of the form 1 + a\x + a^x 1 . If ai = 0, 幻 = 2 we have P(x) = 1 + 2x 2 , which is 
0 for x = 1 and can be written as (1 + x)(l + 2x) — 3x, that is, it is of the form 
(A.2), and hence is not irreducible mod 3. We then consider a\ = 0, 句 =1, 
that is, P(x) = 1 + x 2 . For x = 0, 1, 2, P(x) takes the values 1, 2, 2 mod 3, 
respectively, and hence is irreducible mod 3. For P(x) = l x 2 we then consider 
the possible /(x)’s, which are of the form bo + b\x. Hence, the elements of 
the GF(3^) are: uq = 0, u\ = 1, u\ = 2, = x, u .4 = 2x, M 5 = 1 + x, = 2 + 

x, M 7 = 1 + 2x, Ms = 2 + 2x. Just as for the GF(p), there exists as primitive mark 
u such that the powers (j = 1,2 ,..., p n — l) represent the nonzero elements 
of GF(p n ). In the above case, u = l x is such a primitive mark, since 

w 1 = 1 + x = M5 

= \ -\r 2x -\r = 2x = U 4 

= 2 x (1 + x) = 2 x -j- 2x 2 = 2x + 1 + 2(1 + x^) = 1 + 2x = W7 
w 4 = (1 + 2x)(1 + x) = l + 3x + 2x 2 = 2 + 2(1 + x^) = 2 = U 2 
it' = 2(1 + x) = 2 -j- 2 x = ug 


= (2+ 2x)(1 + x) = 2 + 4x + 2x 2 = 4x = x = U 2 
u 1 = x(l +x) = x+x 2 = 2 + x + (1+ x 1 ) = 2 + x = W6 

= (2 + x)(l + x) = 2 + 3x + x 2 = 1 = wi □ 


The irreducible polynomials P(x) are not unique. It can be shown, however, 
that there always exists a particular irreducible P(x) such that u = x is 3 . primitive 
element. Such a P(x) is called a primitive polynomial. A partial list of primitive 
polynomials is given in Table A.2 [for a more extensive list see Hedayat, Sloane, 
and Stufken (1999)]. 

To perform arithmetic operations in a GF it is convenient to first obtain addition 
and multiplication tables in terms of the powers of the primitive element, x, using 
the primitive polynomial (see Table A.2). We illustrate this with the following 
example. 


Example A.3 Consider the GF(2 2 ). The elements are wo = 0, mi = 1, U 2 = 
X ， W 3 = 1 + x = x 2 , with P(x) = 1 + x + x 2 . We then have the following addi¬ 
tion table: 



MO 

U\ 

U 2 

W 3 

wo 

uo 

U\ 

U2 

U3 

U\ 

U\ 

wo 

M 3 

U 2 

U2 

U2 

W 3 

MO 

U\ 

U3 

M3 

U2 

U\ 

MO 
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Table A.2 List of Primitive 
Polynomials for Selected Values 
of p and n 


GF 


P(x) 

2 2 

1 + 

X x 2 

2 3 

1 + 

X + 

2 4 

1 + 

X x 4 

2 5 

1 + 

x 2 -\-x 5 

3 2 

2 + 

X + x 2 

3 3 

1 + 2x + x 3 

5 2 

2 + 

X + x 2 



0 

x° 

x 1 

x 2 

0 

0 

x° 

X 1 

X 2 

x° 


0 

X 2 

X 1 

x 1 

x 1 

x 2 

0 

x° 

X 2 

X 2 

X 1 

x° 

0 


For example, 

x 1 + x 2 = x(l + x) = x 3 = 1 = x° 


Similarly, we obtain the following multiplication table: 


or 


For example, 



wo 

U\ 

U2 

M3 

Wo 

M 0 

MO 

Mo 

Wo 

U\ 

Wo 

U\ 

U2 

M 3 

U2 

M 0 

U2 

W 3 

U\ 

U3 

MO 

M3 

U\ 

U2 



0 


X 1 

X 2 

0 

0 

0 

0 

0 

x° 

0 

x° 

X 1 


X 1 

0 

X 1 

X 2 


x 2 

0 

x 2 

x° 

x 1 


2 2 4 3 1 

X - X = X = X •x = x= x 


With these tables we can then carry out all arithmetic operations in GF(2 2 ). □ 
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Finite Geometries 


A finite geometry consists of a finite set of elements, called points, that are 
subject to certain conditions about points and lines. Such a geometry can be 
represented in terms of the marks of a Galois field, GF(p n ), where p is a prime. 
Of primary importance in the context of constructing experimental designs are 
K -dimensional projective and Euclidean geometries, denoted by PG ( 尤， p n ) and 
EG([ ， p n ), respectively. 


PROJECTIVE GEOMETRY PG ([’ P N ) 


A point of a K -dimensional projective geometry is defined by the ordered set 
of homogeneous coordinates (/xo, /xi,..., /xk) where /x/ G GF(p n ) and at least 
one /x/ # mo = 0. The symbol (/x/zo, Wi, ... ， /x/ik) denotes the same point, 
where \i G GF(p w ) and \x ^ uq. The total number of points in the PG([ ， p n ) 
then is 


p n(K+l) _ X 
~ p n - 1 ~~ 


=i + p n + p 2n + ■ ■ ■ + p Kn 


(B.l) 


since there exist _ \ ordered sets ^ (0 ， 0 ， .. . ，0 ), and these may be 

arranged in groups of p n — l sets all of whose elements represent the same 
point. 

Now consider two distinct points (/zo, ../x^) and (vo，.. . ， ^k)- Then the 
line between these points is defined by the points 


(MMo + ^0, MMl +vvi,..., [i\x K + vv K ) (B.2) 


Design and Analysis of Experiments. Volume 2: Advanced Experimental Design 

By Klaus Hinkelmann and Oscar Kempthorne 

ISBN 0-471-55177-5 Copyright © 2005 John Wiley & Sons, Inc. 


721 



722 


APPENDIX B 


where /x, y G GF(p n ), not both 0. The number of possible combinations of /x 
and v is p 2n — 1, so that because of proportionality the line (B.2) contains 


P 


2n 


P 


n—l 


P n ~^l 


(B.3) 


points. 

The system of points and lines given by (B.l) and (B.3) is called a K- 
dimensional projective geometry PG(7^ ， p n ). 

In (B.2) we have defined what we might call a 1-space in the PG([ ， p n ). 
More generally, we can define an M-space (M < K) in PG(7^, p n ) as follows: 
If the points (/x/o, M/i, .. ” imk) (i = 0 ， 1 ， . • ” M) are any M + 1 points not in 
the same (M — 1)-space, then an M-space is defined by the points 


(E 

\i=0 


M/0, 


〉: M/ 1 5 • • • 5 〉: 


IMfMK 


(B.4) 


where /xo, Ml ， …， Mm ^ GF(p n ) and not all equal to 0. Alternatively, (B.4) can 
be described in terms of the solutions to K — M independent, linear, homoge¬ 
neous equations 


a/oMo + anMi H - h ^iK^K = 0 


(B.5) 


(i = l, 2, ..K — M) with a" fixed. 

In order to find the number of M-spaces in the 欠 -space (M < K), we need 
to find first the number of ways in which the M + 1 noncollinear base points can 
be chosen, such that they do not all lie in any (M — 1)-space: 


1 st point: 

\+p' i + -- 

■. + p Kn ^ N Kl 

2 nd point: 

p n + - 

■■+ p Kn E 

三 Nk2 

3rd point: 

p 2n + - . 

.+ p Kn E Nk3 

(M + l)th point: 

P Mn + - 

■ . + p Kn E 

三 Nk ， m+\ 


and then the ways in which M + 1 points of a given PG(M, p n ) may be selected 
in a given order so that they do not all lie in any (M — l)-space: 


1 st point 

1 + 垆 + • ■ 

. + p Mn 三 

三 

2 nd point 


•. + p Mn 三 

三 Nm2 

(M + l)th point 


^.Mn - 

P = 

三 Nm,m+i 
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Hence every PG([ ， p n ) contains 

M+l 

n N Ki 

K, p n ) = ^^ (B.6) 

n Nmi 

i=\ 

M-spaces, with Nka and Nmi as defined above. 

Similarly, we can show that every PG(5, p n ) in PG(A^ p n ) is contained in 

. „ (p (S+1)n + ... + P Kn ) X ... x (p Mn + ... + p Kn ) 

炉， ，P) = Jys+m + ... + pMn^ x ... x + p Mn) p Mn 

(B.7) 

PG(M, p n Ys for S < M. In particular, for 5 = 0, every point is contained in p(0, 
M, K, p n ) PG(M, p n ) in PG(K, p n ), and, for 5=1, every line is contained in 
(p(l, M, K, p n ) PG(M, p n ) in PG([ ， p n ), or, since with every pair of points 
also the line containing them is in the PG(M, p n ), it follows that every pair of 
points is contained in M, K, p n ) different PG(M, p n ). 

EUCLIDEAN GEOMETRY EGd P N ) 

Consider a PG ( 尤， p n ) and the subset of points (/zo, [i\, .. • ， [Ik) for which 
fiQ ^ 0. Take /xo = 1, then the points (1, \i\, .. . ， /xk) constitute an EG(K, p n ) 
consisting of p Kn points. The excluded points (0, /i\, .. /ik) form a PG([ — 1 ， 
p n ) with homogeneous coordinates (/ zi, /X 2 , .. i^k)- Hence, the EG(^T, p n ) is 
obtained by deleting from the PG([ ， p n ) a PG ( 欠 一 1 ， p n ). 

Every PG(M, p n ) contained in PG(^T, p n ) but not in PG(^T — 1, p n ) becomes 
an EG(M, p n ) since by deleting PG([ — 1 ， p n ) from PGd 〆）we also delete 
PG(M — 1, p n ) from PG(M, p n ). The number of EG(M, p n ) contained in EG(K, 
p n ) then is K, p n )—\j/{M, K — 1, p n ), with xj/ defined in (B.6). The number 
of EG(M, p n ) containing a given EG(5, p n ) with 5 < M is cp(S, M ， K, p n ) as 
defined in (B.7). 
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Orthogonal and Balanced Arrays 


ORTHOGONAL ARRAYS 

Definition C.l A K x N matrix A, with entries from a set of ^ > 2 elements, 
is called an orthogonal array of strength t, size N, K constraints and s levels if 
each t x N submatrix of A contains all possible t x l column vectors with the 
same frequency A., denoted by OA[N, K, s, t\ 入 ]， where 入 is called the index of 
A and N = Xs l . □ 


For the case E = {0, 1,... ， s — 1} consider the following example. 


Example C.l OA[9, 4, 3, 2; 1]: 








/ o 

0 

0 

1 

1 

1 

2 

2 

2 \ 

A = 

0 

1 

2 

0 

1 

2 

0 

1 

2 

0 

1 

2 

1 

2 

0 

2 

0 

1 


V o 

1 

2 

2 

0 

1 

1 

2 

0 / 


□ 


There are different methods of constructing orthogonal arrays [for a thorough 
discussion see Hedayat, Sloane, and Stufken (1999)]. We shall present here a 
particular method based on projective geometries. 

Theorem C.l If we can find a matrix C = (cij) of K rows and r columns, 
whose elements Cij belong to a GF(p n ), and for which every submatrix obtained 
by taking t rows is of rank t, then we can construct an OA[p nr , K, p n , t; 入 ]. 

Proof Consider r x 1 column vectors § whose coordinates belong to GF(p n ). 
Then there are p rn different Consider then the matrix A whose p Tn columns 
are the vectors C ^ (i = 1, 2, ..p nx ). A is then an orthogonal array: 
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Suppose A* is a ? x p rn submatrix of A, and C* is the corresponding t x r 
submatrix of C. Consider 

A* = CH’...，U 

with s = p Tn , and denote the ith column of A* by a/ (/ = 1,2, .. s), that is, 

ca = C*^ 
or 

c\\^n + c\2^i2 + ... + cu^i T = an 

C21^n + C22^i2 + … + C2t^it = «/2 

Ctl^il + C t 2^i2 + . • • + Ctx^ix = Oiit 

Assume without loss of generality that the first t columns of C* are linearly 

independent. Then consider all the ^ that have the same last r — t components. 
There are p tn different such vectors. Each will give rise to a different a“ But 
the total number of different cti is p tn , that is, each possible a, is obtained 
exactly once from such a set. Now there are p n ( T_t ) possible sets of vectors for 
which this holds. Hence each different a/ is obtained exactly 产 r_ ’ ） times. This 
implies that in A* each possible t x l column vector occurs X = p n ^ x ~^ times 
and shows that A is an orthogonal array of strength t and index 入 . □ 

The rows of C in Theorem C.l may be interpreted as the coordinates of a 
PG(r — 1, p n ) such that no t points lie in a space of dimension < t — 2. To 
find a matrix C then means to find K points of a PG(r — 1, p n ) with the above 
restriction. We illustrate this with the following example. 

Example C.2 Consider the case p n = 2. In the PG(r — 1,2) consider all 
points that are not in the (t — 2 )-space: 

xq x i + • • • + x x —\ = 0 

There are exactly 2 T_1 such points, namely those with an odd number of unity 
components. No three of these points are on the same line, since in the PG(r — 1, 
2) each line has exactly three points, one of which is excluded. Hence we can 
construct an OA[2 T , 2 T_ 1 ,2, 3; X]. 

Consider, for example, the case r = 3: 
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/ 0 1 0 1 0 1 0 1 \ 

( n . •，夸 8 )= 0 0 1 1 0 0 1 1 

V o o o o i i i i / 

and hence 

/ 0 1 0 1 0 1 0 1 \ 

0 0 1 1 0 0 1 1 
= 0 0 0 0 1 1 1 1 

V 0 1 1 0 1 0 0 1 / 

with 人 =1. It is easy to see that A is also an OA[2 r , 2 r_1 , 2, 2; 2]. □ 


BALANCED ARRAYS 

Balanced arrays, or B-arrays, were introduced by Chakravarti (1956) under the 
name partially balanced arrays. The following definition is adapted from Srivas- 
tava and Chopra (1973). 

Definition C.2 A K x N matrix B, with entries from a set ^ of ^ > 2 
elements, is said to be a balanced array of strength t, size N ， K constraints, and 
s levels if for each t x N submatrix Bq of B and for each ， x 1 vector x = (x\, 
X 2 , .. x t ) f with elements from we have 

入 ( 文，及 0 )= 入 ( 户 ( 幻，如） （ C.l) 

where 入 (a:, Bq) is the number of columns of that are identical with x, and 
P(x) is any vector obtained by permuting the elements of x. 

We denote such an array by BA[A^, K, s, t; X r ], where \ r is the vector of the 
totality of X(x, Bo) as defined by Definition C.l. The term \ r is referred to as the 
index set of the B-array. Note that if all elements of A/ are identical, say equal 
to X, then the B-array becomes an orthogonal array with index X. □ 

For the case ^ = {0, 1, 2, … ，s — 1} we illustrate the notion of a B-array 
with the following example. 

Example C.3 Consider BA[15, 5, 3, 2; 1, 2] 

/ 0 0 0 0 0 1 1 1 1 1 2 2 2 2 2 \ 
012121202020101 
B = 102212100202110 

220110012211200 
V121022021001021/ 
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where 


I 入 i = 1 for x\ = X 2 

入 2 = 2 for x\ ^ X 2 □ 

Condition (C.l) can be expressed operationally as follows. Let in x the symbol 
i occur r/ times (0 < / < 5 - — 1). Then 

5—1 

J2 ri = t 

i=0 

and each of the t\/ro\r \!... ! vectors obtained by permuting x has the same 

X value. 

For the special case 5 = 2, the index set A/ consists of ， + 1 components , 入 o, 
入 1 ， …， \ u corresponding to the number of nonzero components in x, which is 
also referred to as the weight of x. 

There exist several methods of constructing B-arrays [see, e.g.，Chakravarti 
(1956, 1961)，Srivastava and Chopra (1973)，Aggarwal and Singh (1981)]. We 
shall mention here one method that is based on the existence of orthogonal arrays 
(Chakravarti, 1961). 

Suppose, an orthogonal array OA[N, K, s, t\ X] can be divided into two dis¬ 
joint arrays such that one array is a 5-array or a degenerate B-array [a degenerate 
B-array being one that has some but not all X(x\,X 2 , … ， x t ) equal to zero], 
with X(x\,X 2 ,... ,x t ) 〈入 for all f x 1 vectors x. Then the remaining array is a 
B-array with X parameters given by X*(x\,X 2 ,..., x t ) = ^ — k(x\,X 2 ,..., x t ). 
As an illustration consider the following example. 

Example C.4 Consider OA[9, 3, 3, 2; 1] 

/ 012012012 \ 

A = 0 1 2 2 0 1 1 2 0 

v 0 1 2 1 2 0 2 0 1 y 

Divide A into 



/ 0 

1 

2 \ 


/ 0 

1 

2 

0 

1 

2 \ 

Ai = 

0 

1 

2 

and A 2 = 

2 

0 

1 

1 

2 

0 


l 0 

1 

2 ) 


l 1 

2 

0 

2 

0 

1 / 


where Ai is a degenerate B-array and A 2 then represents a BA[6, 3, 3, 2] with 
入 1 = 0, 入 2 = 1 ， where 入 1 ， 入 2 are as defined in Example C.3. □ 


In other situations we may be able to obtain a B-array by deleting not only 
columns but also rows from an orthogonal array (see Chakravarti, 1961). 
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Selected Asymmetrical Balanced 
Factorial Designs 


1. 4x2x2, 灸 = 4 (Li, 1944) 



Replication 1 
Block 


Replication 2 
Block 


Replication 3 
Block 


D 

2 

3 

4 

1 

2 

3 

4 

1 

2 

3 

4 

000 

001 

010 

Oil 

000 

001 

010 

Oil 

000 

001 

010 

011 

101 

100 

111 

110 

111 

110 

101 

100 

110 

111 

100 

101 

210 

211 

200 

201 

201 

200 

211 

210 

211 

210 

201 

200 

311 

310 

301 

300 

310 

311 

300 

301 

301 

300 

311 

310 


入 (000) 

= 3 

/z(000) = 

2 

入 (100) 

= 0 

/z(100) = 

l 

4 

入 (010) 

= 0 

"(010) = 

0 

入 (110) 

=1 

"(110) = 

0 

入 (001) 

= 0 

/z(001) = 

0 

入 (101) 

=1 

h(l0l)= 

0 

入 (011) 

= 0 

/z(011)= 

1 

4 

入 (111) 

=1 

h(lll)= 



£(110) = £(101) = £(111) = §, all other E(z) = l,E = ^. 
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2. 4x2x2, ^ = 8 (Li, 1944) 


Replication 1 
Block 

Replication 2 
Block 

Replication 3 
Block 

1 

2 

1 

2 

1 

2 

000 

001 

000 

001 

000 

001 

Oil 

010 

Oil 

010 

Oil 

010 

100 

101 

101 

100 

101 

100 

111 

110 

110 

111 

110 

111 

201 

200 

201 

200 

200 

201 

210 

211 

210 

211 

211 

210 

301 

300 

300 

301 

301 

300 

310 

311 

311 

310 

310 

311 


入 (000) = 3 

h(000) = 

入 (100) = 1 

h(\00) = 

入 (010) = 0 

/z(010) = 

入 (110) = 2 

"(110) = - 

入 (001) = 0 

h(00l)= 

入 (101) = 2 

h(l0l)= - 

入 (011) = 3 

h(0ll)= - 

入 （ 111) = 1 

h(lll)= 


£(111) = f, all other E(z) = 1,E = ^. 
3. 4x4,k = 4 (Yates, 1937b) 



Replication 1 
Block 


Replication 2 
Block 

Replication 3 
Block 

1 

2 

3 

4 

1 

2 

3 

4 

1 

2 

3 

4 

00 

01 

02 

03 

00 

01 

02 

03 

00 

01 

02 

03 

11 

10 

13 

12 

12 

13 

10 

11 

13 

12 

11 

10 

22 

23 

20 

21 

23 

22 

21 

20 

21 

20 

23 

22 

33 

32 

31 

30 

31 

30 

33 

32 

32 

33 

30 

31 
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入 (00) = 3 h(00) = 2 

入 (10) = 0 h(\0)= 長 

入 (01) = 0 h(01) = i 

入 (11)= 1 h(U) = -\ 

£(10) = £(01) = 1, 五 (11) = §,£ = f. 

4 . 4 x 3 x 2, ^ = 12 (Li, 1944) 


Replication 1 
Block 

Replication 2 
Block 

Replication 3 
Block 

1 

2 

1 

2 

1 

2 

000 

001 

001 

000 

001 

000 

Oil 

010 

010 

Oil 

Oil 

010 

021 

020 

021 

020 

020 

021 

101 

100 

100 

101 

100 

101 

110 

111 

111 

110 

110 

111 

120 

121 

120 

121 

121 

120 

200 

201 

201 

200 

201 

200 

211 

210 

210 

211 

211 

210 

221 

220 

221 

220 

220 

221 

301 

300 

300 

301 

300 

301 

310 

311 

311 

310 

310 

311 

320 

321 

320 

321 

321 

320 


Replication 4 
Block 

Replication 5 
Block 

Replication 6 
Block 

1 

2 

1 

2 

1 

2 

001 

000 

000 

001 

000 

001 

010 

Oil 

Oil 

010 

010 

Oil 

020 

021 

020 

021 

021 

020 

100 

101 

101 

100 

101 

100 

111 

110 

110 

111 

111 

110 

121 

120 

121 

120 

120 

121 

200 

201 

201 

200 

201 

200 

211 

210 

210 

211 

211 

210 

221 

220 

221 

220 

220 

221 

301 

300 

300 

301 

300 

301 

310 

311 

311 

310 

310 

311 

320 

321 

320 

321 

321 

320 
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Replication 7 
Block 

Replication 8 
Block 

Replication 9 
Block 

1 

2 

1 

2 

1 

2 

000 

001 

001 

000 

001 

000 

Oil 

010 

010 

Oil 

Oil 

010 

021 

020 

021 

020 

020 

021 

100 

101 

101 

100 

101 

100 

111 

110 

110 

111 

111 

110 

121 

120 

121 

120 

120 

121 

201 

200 

200 

201 

200 

201 

210 

211 

211 

210 

210 

211 

220 

221 

220 

221 

221 

220 

301 

300 

300 

301 

300 

301 

310 

311 

311 

310 

310 

311 

320 

321 

320 

321 

321 

320 


入 (000) 

= 9 

/z(000) : 

23 

= T 



入 (100) 

= 3 

/z(100) : 

l 

= 3 



入卿 ) 

= 3 

/z(010) : 

1 

= 3 



入 (110) 

= 5 

"(110) : 

1 

= _ I2 



入 (001) 

= 0 

"(001) : 

2 

= 3 



入 (101) 

= 6 

"(101) : 

_ _ 1 



入 (Oil) = 6 "(011) = -士 

入 (111) = 4 h(lll) = 

£(101) = £(111) = §， all other E(z) = 1， £ = §. 

Using replicates 1, 2， 3 only will yield a design with the same relative 
efficiencies, but the design is not a BFD. 

5. 4 x 4 x 3, 灸 =12 (Li, 1944) 



Replication 1 
Block 



Replication 2 
Block 



Replication 3 
Block 


1 

2 

3 

4 

1 

2 

3 

4 

1 

2 

3 

4 

000 

010 

020 

030 

000 

030 

010 

020 

000 

020 

030 

010 

110 

100 

130 

120 

130 

100 

120 

110 

120 

100 

110 

130 

220 

230 

200 

210 

210 

220 

200 

230 

230 

210 

200 

220 
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Replication 1 



Replication 2 



Replication 3 



Block 



Block 



Block 


1 

2 

3 

4 

1 

2 

3 

4 

1 

2 

3 

4 

330 

320 

310 

300 

320 

310 

330 

300 

310 

330 

320 

300 

001 

Oil 

021 

031 

001 

031 

011 

021 

001 

021 

031 

Oil 

111 

101 

131 

121 

131 

101 

121 

111 

121 

101 

111 

131 

221 

231 

201 

211 

211 

221 

201 

231 

231 

211 

201 

221 

331 

321 

311 

301 

321 

311 

331 

301 

311 

331 

321 

301 

002 

012 

022 

032 

002 

032 

012 

022 

002 

022 

032 

012 

112 

102 

132 

122 

132 

102 

122 

112 

122 

102 

112 

132 

222 

232 

202 

212 

212 

222 

202 

232 

232 

212 

202 

222 

332 

322 

312 

302 

322 

312 

332 

302 

312 

332 

322 

302 




入 (000)= 

= 3 

/z(000) 

= 

3 







入 (100) = 0 

h(l00) 

= 

0 







入 (010) = 0 

h(0\0) 

= 

0 







入 (110) = 

=1 

h(U0) 

= 

0 







入 (001) = 3 

h(00l) 

= - 

_丄 







入 (101) = 0 

h(l0l) 

= 

1 

T2 







入 (011) = 0 

"(Oil) 

= 

1 

T2 







入 (111) = 

二 1 

h(lll) 

= - 

1 

'T2 






五 (110) : 

2 

- 3 5 

all other E(z) 

=l, E = ■ 

44 

47 - 




3 x : 

3 x 2 ,灸 

= 6 (Yates, 1937b) 







Replication 1 
Block 

Replication 2 
Block 

Replication 3 
Block 

Replication 4 
Block 

1 

2 

3 

1 

2 

3 

1 

2 

3 

1 

2 

3 

100 

200 

000 

200 

000 

100 

100 

200 

000 

200 

000 

100 

010 

020 

120 

020 

120 

010 

210 

010 

110 

010 

110 

210 

220 

110 

210 

110 

210 

220 

020 

120 

220 

120 

220 

020 

201 

001 

101 

101 

201 

001 

201 

001 

101 

101 

201 

001 

021 

121 

Oil 

Oil 

021 

121 

011 

111 

211 

211 

Oil 

111 

111 

211 

221 

221 

111 

211 

121 

221 

021 

021 

121 

221 
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入 (000) = 4 

入 (100) = 0 
入 (010) = 0 
A(110) = 2 
入 (001) = 0 
A(101) = 2 
入 (Oil) = 2 
入 (111) = 1 

五 (110) = | ，五 (lll) = f 


"( 000 ) = § 

h(lOO) = \ 

_ 0 ) = \ 
h(UO) = - 1 z 
h(00l) = \ 

h(m) = -I 

h(011) = -^ 

wiii) = 4 

all other E(z) = 1, E = ^ 


If only replications 1 and 2 or 3 and 4 are used, then the relative information 
for Ai x A 2 is I and Ai x A 2 x A 3 is but the design is not a BFD. 

7. 3 x 3 x 2 x 2, /: = 12 (Li ， 1944) 


Replication 1 
Block 

Replication 2 
Block 

1 

2 

3 

1 

2 

3 

0100 

0000 

0001 

0101 

0001 

0000 

0111 

0011 

0010 

0110 

0010 

0011 

0201 

0101 

0200 

0200 

0100 

0201 

0210 

0110 

0211 

0211 

0111 

0210 

1001 

1100 

1000 

1000 

1101 

1001 

1010 

1111 

1011 

1011 

1110 

1010 

1200 

1201 

1101 

1201 

1200 

1100 

1211 

1210 

1110 

1210 

1211 

1111 

2000 

2001 

2100 

2001 

2000 

2101 

2011 

2010 

2111 

2010 

2011 

2110 

2101 

2200 

2201 

2100 

2201 

2200 

2110 

2211 

2210 

2111 

2210 

2211 


Replication 3 
Block 

Replication 4 
Block 

1 

2 

3 

1 

2 

3 

0101 

0000 

0001 

0100 

0001 

0000 

0110 

0011 

0010 

0111 

0010 

0011 

0200 

0201 

0100 

0201 

0200 

0101 
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Replication 3 

Replication 4 


Block 


Block 


1 

2 

3 

1 2 

3 

0211 

0210 

0111 

0210 0211 

0110 

1001 

1101 

1000 

1000 1100 

1001 

1010 

1110 

1011 

1011 1111 

1010 

1100 

1200 

1201 

1101 1201 

1200 

1111 

1211 

1210 

1110 1210 

1211 

2000 

2001 

2101 

2001 2000 

2100 

2011 

2010 

2110 

2010 2011 

2111 

2201 

2100 

2200 

2200 2101 

2201 

2210 

2111 

2211 

2211 2110 

2210 


A. (0000) 

= 4 

/2(0000) = f 



M1000) 

= 0 

/j(1000) = i 



入 (0100) 

= 0 

ft (0100) = \ 



Mi ioo) 

= 2 

/i(1100) = -i 



A(0010) 

= 0 

_10) = \ 



入 (1010) 

= 2 

/zdOlO) = -i 



入 (0110) 

= 2 

/i(0110) = -i 



入 (1110) 

=1 

h(ino) = 士 



入 (0001) 

= 0 

h( 000 l) = l 



Miooi) 

= 2 

/i(lOOl) = -i 



入 (0101) 

= 2 

/i(0101) = -i 



入 (1101) 

= 1 

/2(1101) = 士 



入 _1) 

= 4 

/2(0011) = 



入 (1011) 

= 0 

h(mi)= i 



入 (0111) 

= 0 

h(Oin) = i 



入 (1111) 

= 2 

h(nn) = -\ 



£(1100) = l ， £(1111) = I，all other E(z) = 1. 


If only replications 1 and 2 or 3 and 4 are used, the relative information 
for Ai x A 2 remains | and for A\ x A2 x A3 x A 4 remains but these 
designs are not BFDs. 
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入 (000) = 5 /z(000)= 

入 (100) = 2 /z(100)= 

入 (010) = 0 "(010)= 

入 (110) = 3 "(110) = 

入 (001) = 0 "(001) = 

入 (101) = 3 "(101) = 

入 (011) = 5 /z(011)= 

入 （ 111) = 2 h(lll)= 


E(01l) = §, £(111) = all other E(z) = 1 ，五 = 


8. 5 x 2 x 2, 灸 =10 (Shah, 1960) 


Block 

123456789 10 


000 

010 

010 

010 

000 

010 

000 

000 

010 

000 

Oil 

001 

001 

001 

Oil 

001 

Oil 

Oil 

001 

Oil 

100 

100 

110 

110 

110 

100 

110 

100 

100 

110 

111 

111 

101 

101 

101 

111 

101 

111 

111 

101 

210 

200 

200 

210 

210 

210 

200 

210 

200 

200 

201 

211 

211 

201 

201 

201 

211 

201 

211 

211 

310 

310 

300 

300 

310 

300 

310 

300 

310 

300 

301 

301 

311 

311 

301 

311 

301 

311 

301 

311 

410 

410 

410 

400 

400 

400 

400 

410 

400 

410 

401 

401 

401 

411 

411 

411 

411 

401 

411 

401 


3811021106110111061101110310210 
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Exercises 


CHAPTER 1 

1.1 Consider the following design with t = 1 treatments in blocks of size 3 
and 4: 

Block 

1 2 3 4 5 6 7 

2 3 3 5 1 7 3 

4 5 6 4 6 2 1 

Treatments 

1 2 4 7 5 6 7 

2 4 13 

(a) Give the parameters for this design. 

(b) Write out the incidence matrix. 

(c) Write out the C matrix of (1.9). 

(d) Show that all differences X[ — Z[< are estimable. 

1.2 Show that CO = O for C of (1.9). 

1.3 The experiment of Exercise 1.1 was originally planned for blocks of size 
3, but it turned out that some of the blocks had actually 4 experimental 
units. The investigator decided to replicate one of the treatments assigned 
to those blocks. 

(a) Sketch the ANOVA table for this experiment, giving sources of varia¬ 
tion and d.f. 


Design and Analysis of Experiments. Volume 2: Advanced Experimental Design 

By Klaus Hinkelmann and Oscar Kempthorne 

ISBN 0-471-55177-5 Copyright © 2005 John Wiley & Sons, Inc. 
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(b) Discuss how the error d.f. can be divided into two sets and what the 
advantage of that would be. 

1.4 Consider the following incomplete block design with t = 6, b = 6, and 

k = 2: ‘ i 


Treatments 


Block 

1 2 3 4 5 6 

1 3 2 4 5 5 

2 1 3 6 6 4 


(a) Write out the C matrix. 

(b) Show that this is a disconnected design. 

(c) Identify a set of estimable functions of the treatment effects. 

1.5 Consider the following data obtained from an experiment using an incom¬ 
plete block design with t = 9, b = 6, k = 3, and 2 subsamples (the numbers 
in parentheses indicate the treatments): 



1 

53.5 , 54.8 

53.2,54.0 

57.7,56.4 



(6) 

(4) 

⑺ 


2 

53.1 , 52.7 

58.6,57.1 

53.9 ， 55.1 



(3) 

(1) 

(2) 


3 

57.2,56.5 

55.0,55.9 

51.5 ， 53.2 

Block 


(9) 

⑺ 

⑻ 

4 

53.7 , 52.9 

53.6,54.6 

57.9 , 56.8 



(4) 

(2) 

(9) 


5 

54.5 , 53.3 

52.8 , 53.2 

53.3 ， 55.0 



(3) 

⑺ 

(5) 


6 

48.9,47.8 

53.5 , 54.9 

56.7 ， 55.4 



⑻ 

(6) 

(1) 


(a) Write out a model for this data set. 

(b) Obtain the intrablock analysis using SAS. 

(c) Obtain the standard errors for all treatment comparisons r/ — r//. 

1.6 Consider an incomplete block design with t treatments, b blocks of size k, 
r replications per treatment, and m subsamples per experimental unit. 

(a) Explain how the derivations in Sections 1.8.2 and 1.8.3 need to be 
modified for this case. 

(b) Using the data from Exercise 1.5 obtain the combined analysis, includ¬ 
ing treatment comparisons. 

(c) Compare the results with those of Exercise 1.5. 
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CHAPTER 2 

2.1 Consider the BIB design 


Block Block 


1 

1 

4 

6 

7 

7 

2 

2 

6 

8 

9 

8 

3 

1 

3 

8 

9 

9 

4 

1 

2 

3 

4 

10 

5 

1 

5 

7 

8 

11 

6 

4 

5 

6 

9 

12 



Block 


2 3 6 7 

13 

12 4 9 

2 4 5 8 

14 

15 69 

3 5 7 9 

15 

13 6 8 

12 5 7 

16 

4 6 7 8 

2 3 5 6 

17 

3 4 5 8 

3 4 7 9 

18 

2 7 8 9 


(a) Give the parameters of this design. 

(b) Write out its C matrix. 

2.2 (a) Describe a scenario where you may want to use the design given in 

Exercise 2.1 but in such a form that each treatment occurs exactly 4 
times in the first 2 positions and 4 times in the last 2 positions over the 
18 blocks. 

(b) Write out the Latinized form of this design. 

(c) Give the SAS statements for analyzing data using the design in (b). 

2.3 (a) Show that the design given in Exercise 2.1 is a 4-resolvable BIB design, 
(b) Give the SAS statements for analyzing data using the design in (a). 

2.4 (a) Show that the design obtained in Exercise 2.3 can be Latinized. 

(b) Give the SAS statements for analyzing data using the design obtained 
in (i). 

2.5 Investigate whether the design given in Exercise 2.1 is locally resistant 
w.r.t. any of the treatments. 


CHAPTER 3 

3.1 Give a proof of Theorem 3.2. 

3.2 Obtain the complement of the BIB design given in Exercise 2.1 and verify 
that it is a BIB design. 

3.3 (a) Using the successive diagonalizing method construct a BIB for t = 25 

treatment in blocks of size 5. 

(b) Obtain the C matrix for this design. 

(c) Suppose that the last replicate is missing from the design. Investigate the 
properties of the residual design w.r.t. estimating treatment differences 
Ti — t ". 
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3.4 Construct the BIB design #25 in Table 3.1. 

3.5 Construct the BIB design #18 in Table 3.1. 

3.6 Construct the BIB design #22 in Table 3.1. 

CHAPTER 4 

4.1 Consider the rectangular PBIB design of Example 4.8. 

(a) Write out explicitly Eqs. (4.34) and obtain a solution of these equations. 

(b) Give explicit expressions for the variances of simple treatment com¬ 
parisons. 

(c) Generate data for this design and, using the g inverse obtained from 
SAS PROC GLM, verify the results obtained in (b). 

4.2 Write out the association scheme for the L4-PBIB(2) design for ? = 9 treat¬ 
ments. 

4.3 (a) Write out the association scheme for the EGD/(7)-PBIB design with 

t = IS treatments. 

(b) Give the parameters of this design. 

(c) Give the P matrices for this design. 

CHAPTER 5 

5.1 Verify that the design D f of Example 5.1 is a PBIB(2) design and give its 
parameters, including the P matrices. 

5.2 Using the PG(3, 2) construct a GD-PBIB(2) with t = 2 6 = \2 treatments 
in 8 blocks of size 6. 

5.3 Show that the dual of the BIB design #8 in Table 3.1 is a triangular PBIB(2) 
design. 

5.4 Construct a L 2 -PBIB(2) design for t = 25 treatments in 20 blocks of size 5. 

5.5 (a) Construct a cyclic PBIB design for t = 1 treatments mb = 1 blocks of 

size k = 4. 

(b) Obtain the efficiency factor of this design. 

(c) Check whether there exists a more efficient cyclic PBIB design than 
the design obtained in (a). 

5.6 Construct a balanced array BA[15, 6, 3, 2] from OA[18, 7, 3, 2; 2] and 
give its index set. 

5.7 Using the method of balanced arrays construct an EGD/3-PBIB design for 
t = 12 treatments in 12 blocks of size 3. 
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5.8 Using the method of balanced arrays construct an EGD/7-PBIB design for 
t = 12 treatments in 12 blocks of size 2 . 


CHAPTER 6 

6.1 (a) Construct an a design for ， = 15 treatments in r = 3 replicates and 

blocks of size k = 5. 

(b) Describe the properties of this design. 

(c) Obtain the efficiency factor of this design. 

6.2 Construct an a design for t = 19 treatments in blocks of size 5 and 4, with 
r = 2 replicates. 

6.3 Construct a generalized cyclic incomplete block design for t = 15 treat¬ 
ments in blocks of size 5. 

6.4 (a) Show that the design of Example 6.6 with r = 3 replicates is a (0, 1) 

design. 

(b) Obtain the efficiency factor of this design. 

6.5 Consider the BTIB design with t = 4 test treatments and blocks of size 3 
with generator designs 


/ 000000 \ 
GDi = 1112 2 3 

\ 2 3 4 3 4 4 / 


/ 0 0 1 1 1 1 2 2 \ 
GD 2 = 1 3 2 2 3 3 3 4 
\24344434/ 


/ 1 1 1 2 \ 
GD 3 = 2 2 3 3 

V 3 4 4 4 / 


(a) Using the generator designs given above, construct two BTIB designs 
with b = 12 blocks. 

(b) Give the parameters of both designs and compare their efficiency fac¬ 
tors. 

6.6 Consider the GD-PBIB(2) design with 6 treatments and 9 blocks of size 2: 


1 3 5 1 6 5 1 3 5 

246324462 
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(a) By inspection of the design above give its parameters and the associa¬ 
tion structure. 

(b) Using the PBIB design above, obtain a PBTIB design for f = 5 test 
treatments. 

(c) Give the parameters of the design obtained in (b). 

6.7 (a) Using the PBIB design of Exercise 6 . 6 , obtain a PTBIB design for 

t = 4 test treatments. 

(b) Give the parameters of this design. 

6.8 Consider the GD-PBIB(2) design for ^ = 6 treatments in b = 6 blocks of 
size 4: 


14 2 5 

2 5 3 6 

3 6 14 


4 15 2 

5 2 6 3 

6 3 4 1 


(a) Investigate the properties of this design as a row-column design. 

(b) Obtain the efficiency factor of the design obtained in (a). 


CHAPTER 7 

7.1 For the 2 4 factorial give explicit expressions for the true response of the 
treatment combination x r =(0 ， 1 ， 0 ， 1) using (7.42) and (7.49). 

7.2 For the 2 5 factorial write out explicitly the contrast that defines the 4-factor 
interaction ABCD. 

7.3 For the 2 3 factorial with r replications per treatment and m subsamples per 
experimental unit write out: 

(a) An expression for the interaction AB. 

(b) An expression for the sum of squares for AB. 

(c) The ANOVA table, including sources of variation, d.f. ， SS ， ^(MS). 

7.4 Consider the 2 6 factorial with one replication per treatment combination. 
Assume that all interactions involving three or more factors are 
negligible. 

(a) Write out the ANOVA table including sources of variation, d.f” SS, 

E(MS). ^ 

(b) Assume that there are m = 2 subsamples per treatment combination. 

Write out the ANOVA table, including sources of variation, d.f” SS, 
E(MS). ^ 

(c) For the situation described in (b) give the input statements for SAS 
PROC GLM and SAS PROC MIXED to perform both the intrablock 
and combined analyses. 
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CHAPTER 8 

8.1 (a) Determine the intrablock subgroup for the 2 5 factorial in blocks of size 

8 , confounding interactions involving three or more factors. 

(b) Write out the remaining blocks. 

8.2 Generate the design of Exercise 8.1 using SAS PROC FACTEX. 

8.3 Consider the design generated in Exercise 8.1 with r = 2 replications. 

(a) Assuming that all interactions involving three or more factors are neg¬ 

ligible, write out the ANOVA table, including sources of variation, d.f” 
SS, and 五 (MS). ‘ 

(b) Suppose that for each experimental unit we have m = 2 subsamples. 
Modify the ANOVA table in (a) appropriately. 

8.4 For the situations described in Exercise 8.3 give the SAS statements for 
performing the respective analyses, using SAS PROC GLM and SAS PROC 
MIXED. 


CHAPTER 9 

9.1 Consider the 2 5 factorial with 3 replicates and in blocks of size 4. Without 
confounding main effects construct a design that provides as much and as 
uniform as possible information about 2-factor interactions. 

9.2 For the design obtained in Exercise 9.1, assuming that 4 - and 5-factor 
interactions are negligible, give the ANOVA table, including sources of 
variation, d.f., SS, and £"(MS). 

9.3 For the design obtained in Exercise 9.1 give expressions for the combined 
estimators of all 2-factor interactions and the variances of these estimators. 

9.4 Discuss some possibilities of arranging a 2 6 factorial in two 8x8 squares, 
giving the actual designs and the relative information provided for main 
effects and 2-factor interactions. 


CHAPTER 10 

10.1 For the 3 4 factorial write out all main effects and interactions together 
with their 2-d.f. components. 

10.2 For the 3 4 factorial write out the parametrization (10.15) for the true 
response of the treatment combination x r = (0, 1, 1, 2). 

10.3 Construct a design for the 3 4 factorial in blocks of size 9 without con¬ 
founding main effects and 2-factor interactions. 
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10.4 Consider the 3 4 factorial in blocks of size 3. 

(a) Suppose two replicates are available. Using partial confounding, con¬ 
struct a design that obtains at least partial information on all main 
effects and 2-factor interaction components. 

(b) Give the relative information for all main effects and 2-factor interac¬ 
tion components. 

(c) Assuming that all 3 - and 4-factor interactions are negligible, sketch 
the ANOVA table for the design obtained in (a), giving sources of 
variation, d.f” SS, and £(MS). 

(d) Give the SAS statements for performing the analysis of an experiment 
using the design obtained in (a) for PROC GLM (intrablock analysis) 
and PROC MIXED (combined analysis). 


CHAPTER 11 

11.1 Consider the 5 2 factorial in 10 blocks of size 5. 

(a) Obtain a plan that yields full information on main effects and at least 
partial information on all interaction components. 

(b) Sketch the ANOVA table, including sources of variation, d.f., and SS. 

(c) Give the parametrization for a(l 1 1 1 1) —a (0 0 0 0 0) 

and give the variance of the estimator for this difference in yields. 

11.2 As an alternative to the design obtained in Exercise 11.1 consider the 
design based on the PBIB-L 2 design LS 51 (Clatworthy, 1973) given by 


Block 


1 

1 

2 

3 

4 

5 

2 

6 

7 

8 

9 

10 

3 

11 

12 

13 

14 

15 

4 

16 

17 

18 

19 

20 

5 

21 

22 

23 

24 

25 


Block 


6 

1 

6 

11 

16 

21 

7 

2 

7 

12 

17 

22 

8 

3 

8 

13 

18 

23 

9 

4 

9 

14 

19 

24 

10 

5 

10 

15 

20 

25 


(a) Set up a correspondence between the treatments 1, 2, ..25 and the 
factorial treatments (xi, X2, X3, X4, X5). 

(b) Explain the major difference between the two designs with respect to 
the estimation of main effects and interaction effects. 

(c) Explain how you would compare the overall efficiency of the two 
designs. 

11.3 Consider the 4 3 factorial in blocks of size 16. 

(a) Assuming that 3-factor interactions are negligible, obtain a plan that 
yields full information on main effects and 2-factor interactions. 
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(b) Suppose that 8 blocks are available for the experiment. Write out the 
ANOVA table, including sources of variation, d.f” and SS. 

11.4 (a) Apply Theorem 11.7 to the case of the 4 〃 factorial and write out the 

Fisher plan for the maximum number of factors. 

(b) For the plan obtained in (a) identify the resulting system of confound¬ 
ing. 

11.5 (a) For the 4 3 factorial in blocks of size 8 obtain a system of confounding 

with orthogonal factorial structure. 

(b) Obtain the efficiency factor of the design. 

CHAPTER 12 

12.1 Consider the 2 2 x 3 3 factorial in 18 blocks of size 6. 

(a) Assuming that interactions involving three or more factors are negli¬ 
gible, obtain a design using the Kronecker product method. 

(b) Explain why the design obtained in (a) is or is not satisfactory, and if 
it is not satisfactory what you might want to do to fix the problem. 

12.2 Consider the 2 2 x 3 3 factorial in blocks of size 6. 

(a) Using the GC/n method construct a system of confounding. 

(b) Obtain the loss of d.f. for the design obtained in (a). 

(c) Compare the design obtained in (a) with that obtained in Exercise 12.1. 

12.3 (a) Apply the method of finite rings to the 2 2 x 3 3 factorial in blocks of 

size 6. 

(b) Obtain the variance for the comparison between the treatments (1 ， 1 ， 
2 , 2, 2) and (0, 0, 0, 0, 0). 

12.4 Consider the 2 2 x 4 2 factorial in blocks of size 8. 

(a) Construct a balanced factorial design such that all treatment contrasts 
are estimable. 

(b) For the design of (a) obtain the efficiency factor. 

CHAPTER 13 

13.1 Consider a 2 8-2 design. 

(a) Give an identity relationship such that all main effects are strongly 
clear and all 2-factor interactions are either clear or strongly clear. 

(b) Obtain the design based on the identity relationship in (a). 

(c) Give the resolution of the design. 
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13.2 Consider the design in Exercise 13.1. Suppose that for the actual experi¬ 
ment blocks of size 16 need to be used. 

(a) Identify a suitable system of confounding. 

(b) Obtain the actual design. 

(c) For the design in (b) sketch the ANOVA table, including sources of 
variation and d.f” assuming that interactions involving four or more 
factors are negligible. 

13.3 Consider a 3^y 2 factorial based on the identity relationship I = ABCD = 
AB 2 EF = all GI. 

(a) Complete the identity relationship. 

(b) Write out the alias structure for all main effects and 2-factor interaction 
components. 

(c) Identify all the clear effects. 

(d) Obtain the actual design. 

13.4 For the design of Exercise 13.3 suppose that blocks of size 9 need to be 
used. 

(a) Obtain a suitable system of confounding and state any assumptions 
you are making. 

(b) For the arrangement in (a) sketch the ANOVA, giving sources of 
variation and d.f” assuming that all interactions involving three or 
more factors are negligible. 

13.5 Consider the 2jy 2 fraction in 4 blocks. 

(a) Using the independent defining words given in Table 13.12, obtain the 
actual design. 

(b) Suppose that the plan in (a) is replicated three times. Sketch the 
ANOVA, including sources of variation and d.f. 

(c) Rather than replicating the plan in (a) three times, discuss other pos¬ 
sibilities of carrying out the experiment. 

CHAPTER 14 

14.1 Construct a saturated orthogonal main effect plan for the 2 7 factorial. 

14.2 Construct an OMEP for the 3 4 factorial in 9 runs. 

14.3 (a) Construct an OMEP for the 2 4 factorial in 9 runs. 

(b) Identify the alias structure for this design. 

14.4 Using the OMEPs obtained in Exercises 14.1 and 14.2, construct an OMEP 
for the 2 7 x 3 4 factorial. 


14.5 (a) Construct an OMEP for the 4 2 x 3 x 2 2 factorial. 
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(b) Suppose the four- and three-level factors are quantitative factors with 
equidistant levels. Show that the estimators for the linear effects of a 
four-level factor and the three-level factor are uncorrelated. 

14.6 (a) Construct an OMEP for the 4 x 3 2 x 2 2 factorial. 

(b) Suppose the plan in (a) is replicated twice. Sketch the ANOVA table, 
including sources of variation and d.f. 

14.7 Consider the following main effect plan for the 2 4 factorial in five runs: 

Run A B C D 

1 0 0 0 0 
2 0 111 

3 10 11 

4 110 1 

5 1110 

(a) Is this an OMEP? 

(b) Show how you would estimate the main effects. 

(c) Obtain the efficiency factor of this design. 

CHAPTER 15 

15.1 Construct a supersaturated design for 14 factors in 8 runs. 

15.2 (a) Construct a supersaturated design for 20 factors in 12 runs. 

(b) Obtain E(s 2 ) for this design and compare it to the lower bound (15.9). 

15.3 (a) Construct a three-level supersaturated design for 12 factors in 12 runs, 
(b) Obtain the av x 2 and max x 2 values of (15.3) and (15.4), respectively. 

CHAPTER 16 

16.1 (a) Using the results of Table 16.1，write out the resolution III.l design 

for 行 = 5 factors. 

(b) Verify that the conditions of Theorem 16.5 are satisfied. 

(c) Show explicitly that, individually, each 2-factor interaction is estimable. 

16.2 (a) Obtain the resolution V.l design for n = 4 factors. 

(b) Verify that the conditions of Theorem 16.4 is satisfied. 

(c) Show explicitly that, individually, each 3-factor interaction is estimable. 
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16.3 Find the search probabilities for the design obtained in Exercise 16.1. 

16.4 Fine the search probabilities for the design obtained in Exercise 16.2. 

CHAPTER 17 

17.1 Consider the case of three design factors A\, A 2 , A 3 , each at three levels, 
and two noise factors By, B 2 , also each at three levels. 

(a) Construct the smallest cross array design such that all main effects are 
estimable. 

(b) Suppose that all five factors are quantitative factors and that only linear 
x linear interactions between A/ and Bj (i = 1, 2, 3; j = l, 2) are 
important. Sketch the ANOVA table for the design in (a), including 
sources of variation and d.f. 

17.2 Find an alternative single array design to the design obtained in Exercise 
17.1 that uses fewer runs. 

17.3 In certain cases when the design and noise factors have two levels it is 
possible to utilize a 2 n ~ t fractional factorial to construct a single array 
design. For example, for three design factors A\, A 2 , A 3 and three noise 
factors B\, B 2 , the 2 6-2 fraction based on 

I = A 1 A 2 A 3 B 2 = A 1 A 2 A 2 B 1 B 2 = B\B2 
yields a suitable design. 

(a) Write out the design in terms of control and noise factor level combi¬ 
nations. 

(b) Identify the design-noise factor interactions that are clear. 

17.4 (a) Use the method described in Exercise 17.3 to obtain a single array for 

four design factors and two noise factors, each at two levels. 

(b) Identify all the clear effects. 

(c) What additional assumptions will you have to make to estimate all 
main effects of the control factors and all control x noise factor inter¬ 
actions? 


CHAPTER 18 

18.1 (a) Construct a balanced square lattice for 25 treatments in blocks of size 
5. 

(b) Sketch the ANOVA table for this design, including sources of variation 
and d.f. 

(c) Show that this design is a BIB design. 
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18.2 (a) Construct a triple three-dimensional lattice for 64 treatments in blocks 

of size 4. 

(b) Sketch the ANOVA table for this design, including sources of variation 
and d.f. 

(c) Consider the B | T-ANOVA and indicate how you would estimate the 
weights w and w f . 

18.3 Construct a simple square lattice for 81 treatments in blocks of size 9. 

18.4 (a) Construct a balanced lattice square for 16 treatments. 

(b) Sketch the ANOVA table for this design, including sources of variation 
and d.f. 

18.5 Construct a triple rectangular lattice for 30 treatments in blocks of size 5. 

CHAPTER 19 

19.1 Construct a balanced CO(5, 10, 3). 

19.2 Consider the irreducible BIB design for 4 treatments in blocks of size 3. 
Use this design to construct the CO(4, 24, 3). 

19.3 (a) Consider the PBIB(2) design in (5.9) and use it as the generator design 

to construct the CO(6, 12, 3). 

(b) Obtain the variances for all simple treatment comparisons. 

19.4 Construct the strongly balanced CO(4, 8, 3). 

19.5 Suppose we have a 2 4 factorial treatment structure. Assume that the 
4-factor interaction is negligible. 

(a) Construct the CO(2 4 , 2 4 , 2 3 ). 

(b) Explain how you would estimate the direct main effects and 2-factor 
interactions. 
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right-angular, 151 
triangular, 139, 165 
Autocorrelation, 707 
Autoregression, first-order, 707 


Balanced incomplete block (BIB) design(s), 
2-4, 9, 52, 71-73, 89-92, 95, 100-120, 
136, 159, 168, 172, 175-180, 187, 
196-197, 201, 207, 211, 281, 313, 602, 
649, 653-654, 690 
affine resolvable, 93-95 
a-resolvable, 92-93, 96 
complement of, 113 
dual of, 158, 166 
irreducible, 113 
reinforced, 196 
residual, 114 
resistant, 98-102 
globally, 99-102 
locally, 99-100 
resolvable, 93, 96-97, 314 
robust, 103 
susceptible, 98-99 
symmetrical, 74, 105, 114, 711 
Balanced treatment incomplete block (BTIB) 
design(s), 199-204, 208 
optimal, 211-212 
R-type, 202 
5-type, 201-203 
Balancedness, 687, 695, 706-707 
Bar chart, 561 
Block(s) 

comparisons, 23, 283, 322 
effect(s), 3-4, 13, 16, 22, 279-281, 300, 399 
eliminating treatments, 13 
ignoring treatments, 13 
initial, 105-106, 110, 167-169, 193, 458, 
463-464, 602 
size(s), 1-2, 14, 279 
unequal, 13, 99, 189, 192 


Bound 
lower, 601 

upper, 47-48, 75, 192, 233, 

682-683 

Branching column, 599 

Chinese remainder theorem, 491 
Collapsing of factor levels, 567, 583, 586-587, 
591 

Component(s) 
pure block, 301 
treatment, 301, 421 
Comparison(s), see also Contrast(s) 
block, 23, 283, 322 
pure, 421 
orthogonal, 248 
post-hoc, 54, 64, 67 

treatment, 3-4, 23, 43, 171, 236, 284, 376, 
430 
Confidence 
coefficient, 212 
interval ⑻， 212 

Confounded, completely, 417-419 
Confounding, 279, 288, 368, 650 
complete, 312, 380, 417 
double, 335-336, 379 
in squares, 336 

partial, 312-314, 318, 324, 327, 331-334, 
343, 351, 375, 380-382, 492, 653 
system(s) of, 323, 329 
efficiency of, 323 
relationship(s), 511-513 
system(s) of, 258, 264, 283, 287-289, 

292-293, 298-300, 327, 362, 366, 
370, 374-375, 378-380, 398-405, 
430, 433, 443-444, 452-453, 
459-461, 466, 473-474, 480, 491, 
549-550, 564, 655-659, 713 
constructing, 457, 491 
detecting, 291 
symmetrical, 467 
Connectedness, 14, 103, 232 
Contrasts, see also Comparison(s) 
orthogonal, 243 
Control 

factor, 634-636, 639-641 
quality, 633 
off-line, 633 

treatment, see Treatment, control 
Criterion, see also Optimality 
average-variance, 49 
chi-squared (x 2 ), 598 
maximum, 604 
determinant, 49 
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E(s 2 ), 601 

smallest eigenvalue, 49 
Crossover design(s), 684-690, 715 
2-treatment, 693-695 
balanced, 687-688, 699, 702, 707-708 
completely, 709 
for residual effects, 687 
minimal, 688-689 
minimal completely, 709 
minimal strongly, 691-692 
partially, 691 

strongly, 687, 691-692, 699, 706, 710 
strongly uniform, 693 
totally, 712-713 
uniform, 692, 698-699 
construction of, 688, 708 
extra-period, 691, 699, 702, 712 
orthogonal, 709 
uniform, 693 

Cyclic development, 107-109, 193-194 

Dealiasing, 545, 569, 577, 609 
Defining equation(s), see Defining relationship 
Defining relationship, 511-513, 545. See also 
Identity relationship 

Design(s), see also Experiment(s); Plan(s) 
A-optimal, 199 
alpha (a), 190-192 

2- occurrence class, 190 

3- occurrence class, 191 
analysis of 

balanced incomplete block, 74 
complete block, 666-669 
disconnected incomplete block, 15 
incomplete block, 3 
binary, 71, 99, 194 
block 

balanced, 712 
extended, 196, 231 
central composite, 637-639 
changeover, see Crossover design(s) 
column, 213, 230-231, 237 
complete block, randomized, see Design(s), 
randomized complete block 
completely randomized (CRD), 43, 46, 263, 
278, 366 

component, 213, 230-234, 237 
computer search, 598 
confounded, 376, 407 
partially, 314, 317, 325 
connected, 43, 98, 230 
crossover, see Crossover design(s) 
cyclic, 169-170 
construction of, 167 


fractional, 169 
disconnected, 281, 460 
dual, 159 

efficient, 89, 239, 708 

equireplicate, 31, 34, 71, 99 

error control, 16, 43, 230, 241, 262, 272, 

366, 452, 685 
experimental, 1-3, 241 
construction of, 164 
factorial, 633. See also Factorial(s) 
asymmetrical, 547 
balanced (BFD), 491-506 
blocked fractional, 549, 555 
fractional, 507-509, 513, 517, 531, 
547-549, 636, 640 

minimum aberration blocked fractional, 

555 

resolution of a fractional, 518 
symmetrical, 394 
two-level, 241 
foldover, 543-544 
partial, 545 
generalized 

cyclic (GC/n), see also Design(s), 

incomplete block, generalized cyclic 
randomized block, 231 
Youden, 231, 712 
generator, 201-205, 212 
highly fractionated, 531 
incomplete block, 1-4, 9, 13, 23, 38, 43, 46, 
50, 91, 119, 123, 131, 169, 172, 189, 
194, 281, 310, 318, 323, 413, 448, 
491, 649, 657, 669 
ANOVA table of, 12 
balanced, see Balanced incomplete block 
(BIB) design(s) 

balanced treatment, see Balanced treatment 
incomplete block (BTIB) design(s) 
binary, 155 

disconnected, 14-15, 462 
disconnected resolved, 417 
doubly balanced, 102 
equireplicate, 120, 155, 452 
generalized cyclic, 193, 237, 448 
Latinized, 90 

partially balanced, see Partially balanced 
incomplete block (PBIB) design(s) 
partially balanced treatment, see Partially 
balanced treatment incomplete block 
(PTBIB) design(s) 
proper, 120, 155, 452 
resolvable, 189, 194-195, 649 
resolved, 418 
symmetrical, 73 
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Design(s) (Continued) 
incomplete Latin square, 91, 231 
Kronecker product, 172-178, 502, 529, 
548-549, 583, 636 

Latin square, 3, 50-51, 90, 263, 688. See 
also Square(s), Latin 
Latin square type, 1, 190, 706 
lattice(s), 3-4, 114, 119, 649-669. See also 
Lattice(s) 

maximum resolution, 518-520, 534-535 
minimum aberration, 520-522 
near-optimal, 192 
nonorthogonal, 563 

optimal, 49, 52, 211, 695-699, 711-713 
orthogonal, 271, 563 
paired comparison, 169 
process, 633 
proper, 31, 34, 71, 99 
quasi-factorial, 3, 114, 119, 649 
randomized complete block (RCBD), 1, 4, 
43, 46, 51, 238, 263, 267, 278, 281, 
300, 366-367, 666, 669 
random variables, 16, 18 
repeated measurements, see Crossover 
design(s) 
replicate, 238 
residual, 172 

resolution III, 516, 527-528, 537-539, 
543-546, 582. See also Resolution, III 
resolution IV, 516, 522, 537-539, 543. See 
also Resolution, IV 
resolution V, 516, 537, 546. See also 
Resolution, V 

resolvable, 114, 189, 238, 653 
resolved, 424 
response surface, 49, 637 
robust parameter, 634 
row, 213, 230-231, 234, 237 
row-column, 190, 213-216, 230-234, 237, 
240, 684-685, 712 
doubly incomplete, 230 
Latinized, 240 
nested, 238 
regular, 230-231, 237 
resolvable, 238, 682 
sampling, 272, 685 
saturated, 539, 546 
search, 608-614, 631-632 
constructing, 614 
sequential construction of, 614 
sequential, 544 
square, see Square(s) 
supersaturated, 596-608 
2-level, 598-599, 603 


3-level, 603 

balanced incomplete block (BIBD)-based, 
601 

Hadamard type, 599-601 
mixed level, 598 

supplemented balance, 196-197, 200 
switch-back, 693 
symmetric, 50 
treatment, 241, 272, 452 
treatment-control, 205 
treatment-control comparison, 198 
unconfounded, 323, 376 
variance-balanced, 98-99, 103 
weighing, 576 
Differences 
mixed, 108 
pure, 108 

symmetrically repeated, 107-109 
Difference set ⑻， 105-108, 112 

Effect ⑻ 

block, 3-4, 13, 16, 22, 281, 300, 399 
fixed, 4 
random, 4 

carryover, 685-686, 712 
mixed, 710 
self, 710, 713 
clear, 518, 539, 543 
strongly, 518 
column, 232 

direct, 685, 697-698, 708-713 
dispersion, 640 
estimable, 281 
fixed, 4, 706-707 

main, 241-242, 247, 250-251, 254, 

260-262, 265, 281, 287, 324, 359-360, 
365, 526, 546, 713 
period, 686, 706 
random, 707 

residual, 685-686, 697-699, 706-710, 713 
first-order, 708-709 
second-order, 708-709 
row, 232 
simple, 242, 249 
sparsity, 558, 605 
subject, 686, 707 

treatment, 3, 16, 22, 197, 232, 399, 

685-686, 706 

Effective number of replications, 79 
Efficiency(ies), 3, 168, 197, 334, 699 
factor(s), 43-48, 75, 103, 129-131, 134, 
171-172, 190-194, 198, 232, 236-238, 
498, 501， 682 
relative, 43 
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Eigenvalue(s) ， 8, 129-131, 155, 170, 198, 233 
Eigenvector(s), 8, 44, 233, 424 
Equation(s) 

Aitken, 27-28, 32 

normal (NE), 5, 10, 28, 134, 213-216, 696 
combined, 36 
interblock, 23-27 
reduced (RNE), 5-10, 13, 27, 74, 

119-121, 131, 170, 214-216, 238, 
320-321, 413-414, 452, 494, 539, 
697-699 
prediction, 641 
Error 

control, 213 

experimental, 1, 5, 14, 189, 214, 279-281, 
323, 610 

homogeneity of, 302 
interblock, 665-666 
intrablock, 3, 76, 665 
observational, 5, 214, 281, 610 
pseudostandard, 562 
rate(s), family wise, 605 
reduction, 213 
simultaneous margin of, 562 
variance, 76, 379 
whole-plot, 639 

Estimate(s), see Estimation; Estimator(s) 
Estimation 
capacity, 522 
combined, 31, 36 
index, 522 
interblock, 31 
intrablock, 31 

maximum-likelihood, 39, 648 
restricted (residual) maximum-likelihood 
(REML), 40-41, 648 
Estimator(s) 

Aitken, 38 

best linear unbiased (BLUE), 416-418, 
421-424 
intrablock, 3 

combined, 34, 38-39, 77-78, 155, 333, 
427-429, 651-653, 672 
interblock, 34-36, 303, 651-653 
intrablock, 34-39, 78, 133, 155, 303, 
651-653, 672 

residual (restricted) maximum-likelihood 
(REML), 40-42 
Shah, 155 

Yates, 37-39, 77-78, 96, 136 
Experiment(s), see also Design(s); Factorial(s) 

2 3 , 324 

2 4 , 327 

2 5 , 335 


2 n , 329 

3 2 , 377 

3 3 , 363, 367-370, 377 
3 5 , 402 

3 n , 363, 366 
confounded, 303 
cross-array, 640-641 
exploratory-type, 278 
factorial, 114, 279, 323, 657 
analysis of, 262 
confounded, 657 
field plot, 1 

robust-design, 633-637, 642, 646 
rotation, 685 
search, 617-619 
analysis of, 617 
single-array, 641 
split-plot type, 595 
supersaturated, 604-606 
analysis of, 604 
Experimental unit, 2, 74 
Experimentation, sequential, 648 

Factor(s) 

adjustment, 639-641, 648 
blocking, 189, 551-553 
control, 634-636, 639-641 
design, 634-636, 639-641, 646-648 
environmental, 634 
expanding, 587 
highest common, 458 
noise, 634-636, 640-641, 646-648 
pseudo, 431, 435, 441-443, 463, 467, 491, 
660 

qualitative, 362 
quantitative, 362 
random, 646 

scarcity of active, 596, 605 
treatment, 551 

Factor level(s), collapsing of, 567, 583, 
586-587, 591 

Factorial(s), see also Design(s); Experiment(s) 

2 2 , 650 

2 3 , 267, 272, 279, 282, 293 -295, 298, 

304-306, 309-311, 325-326, 
337-338, 343, 511, 714 

2 4 , 293, 296, 338, 350-351 
2 6 , 441 

(2 2 ) 3 , 440 

2 n , 264, 278, 283, 288-289, 293, 303, 312, 
329, 513 

3 2 , 359, 388 

3 3 , 366, 371 
3' 405 
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Factorial(s) (Continued) 

3 n , 368, 374, 380, 524 

4 2 , 431 

4 3 , 434-436, 441-442 
6 2 , 443-446, 460 

6 n , 443-444 

p n ， 394, 398-401, 405-406, 410-412, 428 
(p m ) n , 431 
p 細 ， 431 

s n , 431, 447, 452-454, 457, 465, 529, 558, 
564-565 

analysis of, 265, 343, 366, 412 
asymmetrical, 465-467, 474, 477, 480, 
483-485, 488, 505, 529, 558, 567, 
582, 586, 595 
balanced, 728-734 
confounding in, 466 
calculus, 448 

fractional, 264, 362, 507-509, 525, 
547-549, 576, 715 
asymmetrical, 548 
blocking in, 549 
nonorthogonal, 546 
regular, 517 
resolution of, 516 

mixed, see Factorial(s), asymmetrical 
symmetrical, 409, 431, 467, 477, 480, 488, 
517, 529, 564, 583, 593-595, 653 
confounding for, 447 
unreplicated, 558 
2 n , 558 
Field(s) 

finite, 481, 716 

Galois (GF), 112, 480-483, 716-718, 724 
theory, 112, 431 

Foldover principle, 543-545, 577 
Fraction(s) 
foldover, 544 
regular, 531 
Fractional 

replicate(s), 513, 517, 524, 531 
replication, 509-510, 516, 529 
Frequencies 

nonproportional, 594 
proportional, 567, 583-586 
Function(s) 

confounded, 417 

estimable, 8, 11, 38, 52, 75, 197, 215, 230, 
424, 541, 550-551, 699 
linearly independent, 462 
of treatment effects, 10, 35 
linearly independent, 421 
loss, 635, 639 
minimum, 129 


penalty, 606 

smoothly clipped absolute deviation 
(SCAD), 606 

Gauss-Markov theorem, 3, 10 
Generalized inverse(s), 7-10, 30, 61, 131, 198, 
541 

Generator(s), 401 
of the intrablock subgroup, 407 
Geometry(ies) 

Euclidean, 162, 409, 721-723 
finite, 158, 162, 409, 721 
projective, 162, 409, 721-724 
Group(s) 

Abelian, 105, 110, 128, 289, 458 
additive, 108-112 
cyclic, 688 
Guardrails, 559-560 

Half-normal plot(s), 558-560 
Harmonic mean, 45-46 
Heterogeneity 

eliminating, in two directions, 3, 89, 213 
two-way elimination of, 2, 162 
Hypercube 
of strength 2, 406 
of strength d, 406 

Ideal(s), 481-484 

Identity relationship(s), 513-518, 525-526, 
529-531, 544, 550, 554, 558, 637, 641 
Information, 280, 323, 334 
combined intra- and interblock, 154, 333, 
349, 386, 426, 653, 658 
function, 48 

interblock, 3, 26-27, 46, 77, 303, 308, 314, 
332, 385, 422-426, 707 
recovery of, 3-4, 22-23, 40 
intersubject, 707 

intrablock, 3, 26, 48, 308, 314, 330, 350, 
375, 381, 421-422, 426, 653, 658, 666 
intrasubject, 707 
matrix, see Matrix, information 
relative, 337, 501 
total relative loss in, 493 
within-row-and-column, 336 
Interaction(s), 241-243, 247, 251, 254, 
260-262, 265, 281, 286, 297, 359, 
360-365, 449 

1- factor, 297 

2- factor, 250, 280, 292, 324, 337, 361, 

373-374, 379, 395, 403-405, 434, 492, 
536, 539, 546, 608-609 
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3-factor, 250, 280, 337, 368, 378-379, 434, 
567-568, 608 
n-factor, 395, 493 

component(s), 366-369, 376, 412, 422, 493, 
658, 713 
modified, 487 

confounded, 279, 282, 401, 407, 425, 462, 
489 

design-noise factor, 647 
direct-residual treatment, 710 
generalized (GI), 258-260, 285-286, 364, 
396-399, 408, 433, 514, 525, 551 
high-order, 280, 508, 516, 608-609 
independent, 285-287, 292, 399-402, 407, 
433-434, 468, 514, 529, 551 
independent confounded, 374 
low-order, 287, 403, 513, 608 
mixed, 530-531 
negligible, 279, 614 
nonnegligible, 608 
partial, 261-265 

Intrablock subgroup (IBSG), 289-291, 370, 
374, 379, 400-401, 405-408, 434-435, 
484, 550, 565 

Lagrangian multipliers, 419 
Lattice(s) 

analysis of, 657-659 
balanced, 653-656, 659 
efficiency of, 669 
higher-dimensional, 654, 657 
one-restrictional, 654-655, 658, 668-669, 
682 

two-dimensional (square), 654-655, 670, 
679, 682-683 

three-dimensional (cubic), 654-656, 659, 
662 

quadruple, 654-656 
quintuple, 656 
rectangle ⑻， 678-679 
rectangular, 3, 194, 654-655, 679 
near-balanced, 654 
one-restrictional, 654 
simple, 654, 680 
triple, 654, 681-682 
septuple, 656 
sextuple, 656 
simple, 655-656, 670 
triple, 655-656, 659, 662 
two-restrictional, see Lattice square 
Lattice square, 239, 654, 671, 675-678 
balanced, 654, 671-672 
semibalanced, 654, 671 
unbalanced, 654, 675 


Least squares, 28 
generalized, 27 

means (LSM), 58, 69, 230, 271, 706 
direct, 707 
residual, 707 
ordinary, 4 
penalized, 605-606 
theory of, 3 

Likelihood function, 40-42 
log-, 41 
Loss, 633-635 
average, 636 
expected, 635 
function, 635, 639 
total relative, 501 

Matrix(ces) 

association, 120-121, 127, 173-174 
C ， see Matrix(ces), information 
circulant, 170, 459 
coefficient, 25, 121 
completely symmetric, 50 
concordance, 72, 155, 173, 452 
concurrence, 72, 190, 196, 199 
cyclic, 460 
design, 238, 612 
design-model, 413 

Hadamard, 575-576, 593, 599, 600, 603 
cyclic, 600 
noncyclic, 600 
normalized, 576 
seminormalized, 576, 599-600 
incidence, 5, 71, 113-114, 158, 173, 177, 
187, 452 

observation-block, 473 
period-residual effect, 686 
period-treatment, 686 
treatment-block, 473 
information, 7, 48, 129, 197-198, 232, 
236-238, 695-698, 708 
transformation, 244, 251 
variance-covariance, 11, 24, 75, 197, 383, 
424, 428, 496, 539, 549 
Maximum likelihood (ML), 64-66, 648 
residual (REML), 66-67, 81, 392, 648 
Method 

direct, 180, 188 
of differences, 158-160 
successive diagonalizing, 115, 194-195 
Taguchi, 633-637 
Minimum aberration, 555 
Missing plots, 13 
Model ⑻ 

general linear mixed, 40 
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Model(s) (Continued) 
linear, 4, 262 
mixed, 648 
search, 608-609 
location-scale, 639 
mixed, 333 
random effects, 40 
regression, 605, 647 
response surface, 647 
three-part, 313 
Modeling 

dispersion, 638, 641 
dual-response, 641 
location, 638, 641 

Noiseless, 610, 618-619 
Noisy, 625, 630 
Nonorthogonality, 3, 599, 605 
measure of, 599 

Optimal, see also Optimality 
A-, 197, 211 
blocking, 551 
E(s 2 )-, 601—603 
universally, 698, 712-713 
Optimality, 48-50, 601, 693, 708, 711 
A-, 49 

criterion, 48-49, 197 
D-, 49 
E-, 49 
Kiefer, 50 
MS-, 238 
MV-, 199 
universal, 50, 712 
Orthogonal series, 114 
Orthogonality, 284, 583, 593, 596-597, 691, 
713 

adjusted, 232, 237 
between-interaction, 494 
deviation from, 597-598 
measure of, 597-598 
near-, 596-597 
within-interaction, 494 

Parameter(s) 
of the first kind, 124 
of the second kind, 122-128 
Parameterization 

of treatment responses, 245, 256, 297, 365, 
410, 485 

non-full rank, 366 

Partially balanced incomplete block (PBIB) 
design ⑻， 3-4, 9, 52, 119, 122-127, 131 ， 


137, 158, 172-173, 207-209, 281, 467, 
649, 653, 669, 691 

2- associate class (PBIB/2), 129, 132, 

136-137, 155-156, 168, 172, 190 
cyclic, 137, 141, 157, 169 
group-divisible (GD), 137-138, 156-164, 
208, 502-504 
L 2 , 140, 166, 670 
L 3 , 670 
Li, 140, 670 
Latin square, 166 
Latin square type, 137, 140, 157 
regular group-divisible, 156 
semiregular group-divisible, 156 
singular group-divisible, 156 
triangular, 137-139, 156, 165 

3- associate class (PBIB/3), 169, 176, 190, 

231, 234, 670 
cubic, 137, 146, 670 
generalized group-divisible (GGD), 137, 
143 

generalized triangular, 137, 144 
rectangular, 137, 142, 147-148 

4- associate class (PBIB/4), 680 
generalized right-angular, 153 
right-angular, 137, 151 
resolved, 680 

m-associate class (PBIB/m), 127, 131, 134 
cubic, 149 

cyclic, 137, 167-170, 193, 460, 691 
extended group-divisible (EGD), 137, 
147-148, 178, 187, 466, 493, 
499-505 

EGD/(2 V -1), 147, 177-187 
EGD/3, 176-179, 186, 502 
EGD/7, 180 

generalized cyclic (GC/n), 458-460, 465, 
477-478 

hypercubic, 137, 149, 187-188 
Partially balanced treatment incomplete block 
(PTBIB) design(s), 205, 208-209, 230 
construction of, 208-209 
type I, 205-206 
type II, 207, 210-211 

Partition(s), 260, 363-364, 395-396, 401, 404, 
409 

cyclic, 454 
hierarchical, 457, 461 
mixed, 488, 536 
orthogonal, 255, 393 
pure, 487, 536 
Performance measure, 640 
Plan(s) 

Fisher, 403, 407, 564, 576 
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main effect(s) (MEP), 564, 608-609 
auxiliary, 586-589, 593 
constructing, 585-586 
Fisher, 564-566, 577, 597 
nonorthogonal, 594 
Plackett-Burman, 575-578, 597-599 
orthogonal (OMEP), 564-569, 575-577, 
582-595, 614 
saturated, 566, 594 
plus one (MEP.l), 611-614 
resolution V plus one, 614-617 
saturated, 594, 603, 611 
weak MEP.l, 612 
one-at-a-time, 594 
Polynomial 
irreducible, 718 
orthogonal, 361 
primitive, 115, 719 
Primitive 
mark, 717 

polynomial, 115, 719 
root, 112-113, 115, 717 
Procedure(s) 

ANOVA, 342 
computational, 52 
Dunnett’s, 199, 212 
maximum likelihood (ML), 63 
randomization, 16 

residual (restricted) maximum likelihood 
(REML), 63, 342 
SAS, 5^ SAS 
Satterthwaite, 387, 392 
Shah, 79 

variable selection, 605 
Yates, 37-38, 61-63, 70, 77, 386, 664 
Product 

Kronecker, 172, 177, 244, 251, 450, 469 
symbolic direct (SDP), 172-173, 186, 448, 
468 

symbolic inner (SIP), 179-180 
Projection, 577 
Proportionality condition, 585 
Pseudofactor(s), see Factor(s), pseudo 

Randomization, 595 
restricted, 338, 707 
test(s), see Test(s), randomization 
theory, 212 

Regression, 605, 641, 647 
all-subsets, 605 
linear, 605 

Repeated measurements, see Crossover 
design(s) 

Replication groups, 89, 96, 107 


Residue class, 193 
Resolution, 516 
2R, 517 
2/?+ 1, 517 
III, 516, 536 
nonorthogonal, 594 
orthogonal, 564, 582 

III, l, 611, 614 

IV, 516, 536, 577 

V, 516, 536, 614 
V.l, 614-617 
VII, 608 

maximum, 518, 520 
upper bound of, 534 
Resolvability, 92 
Response surface, 648 
Revealing power, 610 
Ring(s), 128 
finite, 468, 481-482 
Robustness, 103 

SAS 

PROC FACTEX, 293, 325, 338, 350, 370, 
435, 445, 469, 474, 511, 517, 520, 527, 
531 

PROC GLM, 52, 58, 81, 216, 267, 277, 311, 
338, 342, 350, 387, 391, 569, 618, 699 
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